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FOREWORD 


The  twelfth  ‘Aha  Huliko’a  Hawaiian  Winter  Workshop,  was  held  from 
January  16  to  19,  2001  in  Honolulu,  Hawaii.  Its  topic  was  “From  stirring 
to  mixing  in  a  stratified  ocean.” 

Stirring  and  mixing  are  the  physical  processes  that  convert  variance  from 
the  eddy  to  the  molecular  scale  and  that  may  need  to  be  parameterized  in 
ocean  models.  This  scale  range  is  highly  complex  and  contains  many 
physical  processes.  The  workshop  focused  especially  on 

(1)  The  parameterization  of  mesoscale  eddies 

(2)  General  approaches  to  stirring,  including  the  application  of  ideas 
from  dynamical  systems  theory 

(3)  Inertial  instability,  submesoscale  motions,  and  vortical  motions 

(4)  The  interplay  of  isopycnal  and  diapycnal  processes 

(5)  Processes  in  the  surface  mixed  layer 

(6)  The  stirring  and  mixing  of  biologically  active  tracers 

(7)  Mixing  efficiency,  i.e.  the  fraction  of  energy  lost  from  the  mean 
flow  that  produces  a  vertical  buoyancy  flux,  and 

(8)  Differential  mixing  of  heat  and  salt. 

There  is  a  need  not  to  just  understand  these  processes,  but  also  to 
parameterize  them  in  a  form  that  can  be  implemented  in  ocean 
circulation  models. 

The  lectures  of  the  workshop  are  published  in  these  proceedings.  The 
order  of  the  papers  loosely  follows  the  agenda  of  the  workshop,  covering 
observationSi  theoretical  analyses,  and  numerical  modeling  results.  Also 
included  is  a  summary  of  the  workshop. 

The  workshop  was  supported  by  the  Office  of  Naval  Research  grant 
number  NOOO 14-00- 1-0 168.  It  was  hosted  by  the  Department  of 
Oceanography,  School  of  Ocean  and  Earth  Science  and  Technology, 
University  of  Hawaii.  The  excellent  facilities  and  the  capable  staff  of  the 
Imin  Conference  Center  contributed  greatly  to  the  success  of  the 
meeting.  The  local  organization  and  logistical  arrangements  were 
expertly  handled  by  Ms.  Sharon  Sakamoto.  This  proceedings  volume 
came  into  existence  through  the  creative  and  dedicated  research  of  the 
scientists  who  gathered  in  Hawaii  and  provided  the  articles  that  follow. 
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^  'Aha  Huliko'a  is  a  Hawaiian  phrase  meaning 
an  assembly  that  seeks  into  the  depth  of  a 
matter. 
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Stirring  and  mixing:  What  are  the  rate-controlling 
processes? 

Chris  Garrett 

Department  of  Physics  and  Astronomy,  University  of  Victoria,  Victoria, 

British  Columbia,  Canada 


Abstract.  The  parameterization  of  unresolved  processes  in  oceanographic 
models  is  complicated  by  the  interplay  of  processes  on  a  wide  variety  of  space 
and  time  scales.  The  lack  of  spectral  gaps  further  complicates  the  situation, 
though  perhaps  not  as  seriously  as  might  be  feared.  Understanding  the 
interaction  of  different  processes,  and  determining  which  one  is  critical,  or 
rate-controlling,  may  be  aided  by  thinking  in  terms  of  a  triple  decomposition 
into  mean,  eddies  and  turbulence.  Particular  physical  processes  reviewed 
include  the  fate  of  energy  released  in  baroclinic  instability  and  the  ultimate 
thermohaline  frontolytic  mechanism  on  mean  isopycnal  surfaces.  Both  of 
these  issues  may  have  implications  for  diapycnal  mixing  rates.  Another 
important  question  reviewed  briefly  is  that  of  the  efficiency  of  conversion  of 
turbulent  kinetic  energy  to  mean  potential  energy;  we  do  not  know  what 
“external”  parameters  determine  it.  The  kinematic  details  of  overturning  are 
also  discussed  in  terms  of  the  probability  distribution  of  displacements  from 
a  stably  stratifled  buoyancy  profile. 


1.  Introduction 

The  behavior  of  the  ocean  is  affected  by  a  variety  of 
processes  occurring  at  space  and  time  scales  that  are  too 
small  to  be  resolved  explicitly  in  models.  The  influence 
of  these  processes  then  needs  to  be  parameterized  in 
terms  of  variables  that  are  explicitly  included  in  the 
models. 

Following  a  “Reynolds  decomposition”  of  variables 
into  a  (slowly  changing)  mean  and  fluctuations,  we 
therefore  seek  a  representation  of  the  eddy  fluxes  u[C' 
for  a  scalar  C  and  for  momentum,  where  the  prime 
superscript  denotes  the  fluctuation.  For  the  scalar  C  it 
is  usual  to  assume  that  the  eddy  flxix  is  related  by  a 
tensor  to  the  local  gradient  of  the  mean  C,  as  would 
be  appropriate  in  a  mixing  length  theory  in  which  the 
fluctuations  are  related  to  particle  displacements  that 
are  small  compared  with  the  distance  over  which  the 
mean  changes  significantly.  Thus 

^=-TijdC/dx^.  (1) 

The  symmetric  part  Kij  of  Tij  is  diagonalizable  and 
is  likely  to  represent  down-gradient  diffusion  parallel  to 
the  principal  axes  of  the  tensor.  The  antisymmetric 
part  Sij  of  Tij  has  an  associated  “skew  flux”  given 


by  _  _ 

Fsi  =  -SijdCjdxj  =  (D  X  VC)i  (2) 

wher^D  =  —(5235531,512).  This  flux  is  perpendicular 
to  VC  and  may  be  written  as 

Fs  =  -(V  X  D)C  -}-  V  X  (DC).  ‘  (3) 

The  second  term  of  this  is  non-divergent  and  so  does 
not  affect  the  evolution  of  C.  The  first  term  is  advective 
with  a  velocity  U5  which  may  be  written 

Usi  =  dSij/dxj.  (4) 

This  standard  formalism  (e.g.,  Rhines  and  Holland 
1977;  Moffatt  1983;  Middleton  and  Loder  1989)  is  purely 
kinematic.  In  practice,  however,  it  seems  likely  that  Kij 
describes  large  mixing  rates  along  mean  isopycnals  and 
a  very  much  smaller  diapycnal  mixing  rate,  whereas 
is  related  to  the  difference  between  Lagrangian  and  Eu- 
lerian  mean  flows.  Values  for  Kij  and  U5  appropriate 
for  the  oceans  in  their  present  state  could  conceivably 
be  obtained  from  appropriate  and  sufficient  observa¬ 
tions.  However,  extrapolation  of  limited  measurements 
and  the  requirements  of  models  that  seek  to  be  pre¬ 
dictive  for  different  ocean  states  require  that  we  obtain 
formulae  for  Kij  and  in  terms  of  resolved  variables. 
This,  in  turn,  requires  that  we  understand  the  processes 
responsible  for  the  fluxes. 
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Many  problems  immediately  arise.  One  is  the  va¬ 
lidity  of  a  representation  of  the  fluxes  in  terms  of  local 
mean  gradients.  There  au'e  clearly  examples  in  the  ocean 
in  which  ocean  properties  are  advected  by  coherent  ed¬ 
dies  over  larger  distances  than  those  over  which  mean 
gradients  are  reasonably  constant.  This  is  likely  to  be 
even  more  of  a  problem  for  momentum,  which  may  be 
carried  long  distances  by  waves.  Such  situations  call 
for  diflFerent  forms  of  parameterization  but  will  not  be 
considered  here. 

Another  fundamental  difficulty  is  associated  with  the 
decomposition  into  mean  and  fluctuations.  This  es¬ 
sentially  assumes  that  there  is  a  spectral  gap  of  some 
sort  between  the  mean  and  fluctuations,  allowing  an 
assumption  that  the  mean  changes  little  over  a  time  or 
space  scale  large  enough  for  the  determination  of  statis¬ 
tically  accurate  eddy  fluxes.  This  important  issue  will 
be  discussed,  albeit  naively,  in  Section  2. 

A  further  basic  issue  concerns  the  extent  to  which 
different  ocean  processes  are  in  parallel  or  in  series. 
If  in  parallel,  which  is  dominant?  If  in  series,  which 
one  controls  the  eddy  flux,  leaving  the  other  just  to 
do  what  it  has  to?  This  will  be  discussed  in  Section 
3,  followed  by  a  discussion  in  Section  4  of  the  possi¬ 
ble  insights  to  be  gained  from  a  “triple  decomposition” 
into  mean,  eddies  and  turbulence.  Section  5  addresses 
the  inter-relationship  of  isopycnal  stirring  and  diapyc- 
nal  mixing,  and  Section  6  speculates  on  the  factors  that 
may  determine  the  “mixing  efficiency’  with  which  me¬ 
chanical  energy  input,  such  as  that  from  breaking  inter¬ 
nal  waves,  is  converted  to  an  increased  potential  energy 
of  the  mean  state.  Some  clues  on  this  may  come  in  the 
future  from  careful  analysis  of  overturning  motions  in 
the  ocean,  so  some  recent  ideas  and  results  on  this  topic 
are  summarized  in  Section  7. 

2.  The  Spectral  Gap 

At  mid-latitudes,  internad  waves  may  overlap  in  spa¬ 
tial  scale  with  other,  quasi-geostrophic,  motions,  but 
have  different  time  scales.  Thus,  in  determining  the  ef¬ 
fect  of  the  former  on  the  latter,  one  could  presumably 
exploit  the  existence  of  a  spectral  gap  in  the  frequency 
domain.  Near  the  equator  the  internal  wave  time  scales 
may  be  longer  and  overlap  with  the  time  scales  of  other 
motions,  so  that  a  separation  does  not  appear  to  be 
possible  in  either  time  or  space.  Nonetheless,  a  “dy¬ 
namical  gap”  still  exists,  which  is  what  permits  one  to 
talk  separately  about  the  two  classes  of  motion,  and  one 
assumes  that  it  will  still  be  possible  to  paraimeterize  the 
eSects  of  the  waves  on  the  other  motions. 

Motions  other  than  internal  waves  (emd  even  smaller 
phenomena  such  as  those  associated  with  double  diffu¬ 
sive  processes)  may,  of  course,  have  a  continuous  spec¬ 


trum,  without  any  spectral  gap.  This  would  appear  to 
make  parameterization  of  small  scales  impossible.  The 
success  of  Large  Eddy  Simulations  (e.g.,  Metais  1998) 
has  shown,  however,  that  the  details  of  unresolved  scales 
may  be  imimportant  if  their  only  role  is  to  absorb  vari¬ 
ance  that  is  generated  at  larger  scales  and  cascades  to 
small  scales.  This  could  occur  in  three-dimensional  tur¬ 
bulence,  with  the  only  requirement  being  that  the  start 
of  the  inertial  subrange  be  resolved. 

In  the  ocean,  of  course,  as  in  the  atmosphere,  the  cas¬ 
cade  may  be  partly  to  larger,  rather  than  smaller,  scales 
so  that  the  details  of  the  small-scale  behavior  do  affect 
large  scales  after  a  finite  time.  This  is  the  classic  prob¬ 
lem  of  chaotic  behavior  and  lack  of  predictability.  On 
the  other  hand,  experience  in  the  atmosphere  suggests 
that,  while  “weather”  is  unpredictable,  “climate”  may 
be  predictable  (e.g..  Mote  and  O'Neill  2000).  Maybe 
we  can  hope  for  the  same  in  the  ocean. 

A  naive  view  might  thus  be  that  the  lack  of  a  spec¬ 
tral  gap  is  less  important  than  might  have  been  feared. 
Either  separation  is  possible  using  a  dynamical  gap,  or 
unresolved  scales  do  not  have  a  back  effect  on  resolved 
scales,  or  the  back  effect  does  not  affect  the  climatic 
state  of  the  ocean.  The  issue  is,  however,  one  that 
should  be  kept  under  review. 

3.  Are  Processes  in  Parallel  or  in 
Series?  What  is  Rate-Controlling? 

In  three-dimensional  turbulence  at  high  Reynolds 
number,  the  turbulent  scalar  flux  uC  is  in  parallel 
vdth  a  much  smaller  molecular  flux  -kVC,  with  k  the 
molecular  diffusivity.  The  turbulent  flux  does  depend 
on  the  presence  of  the  molecular  diffusivity  in  series;  as 
stressed  by  Nakamura  (1996)  and  Winters  and  D’Asaro 
(1996),  the  total  flux  may  be  thought  of  as  being  purely 
diffusive,  but  across  a  highly  convoluted  surface  of  con¬ 
stant  concentration.  The  value  of  the  molecular  dif¬ 
fusivity  does  not  determine  the  value  of  the  turbulent 
flux;  reducing  the  molecular  diffusivity  would  just  lead 
to  an  increase  in  the  streakiness,  or  fine-scale  gradients, 
of  the  scalar. 

Mathematically  this  can  be  summarized  from  the 
equation  for  the  rate  of  change  of  the  concentration 
variance 

dC^  (  _  _  _ V  _ 

-I-  V  ■  C'2  -I-  u'C'2  -  kVC"2  j  -f.  2u'C"  •  VC 

=  -2kVC'  •  VC'.  (5) 

This  is  just  the  Osbom-Cox  formula  (Osborn  and  Cox 
1972)  if  C  is  the  temperature.  If  the  time-dependent 
and  divergence  terms  on  the  left  hand  side  of  (5)  are 
neglected  on  the  grounds  of  stationarity  and  spatial  ho- 
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mogeneity,  there  is  a  balance  between  variance  produc¬ 
tion  and  dissipation.  The  turbulent  eddy  flux  may  re¬ 
main  the  same  when  k  is  reduced  provided  that  the 
mean  square  gradient  of  the  concentration  fluctuations 
is  increased. 

One  might  say  that  the  turbulent  and  molecular 
fluxes  act  in  parallel,  with  the  former  dominating,  but 
that  the  processes  also  act  in  series,  with  the  turbulent 
stirring  being  the  rate-controlling  process.  The  molecu¬ 
lar  diffusion  just  does  what  it  has  to;  it  is  essential  but 
does  not  control  the  flux. 

One  can  think  of  similar  situations  in  the  environ¬ 
ment.  For  example,  the  meridional  flux  of  potential 
temperature  in  the  atmosphere  is  controlled  by  the  eddy 
flux  associated  with  weather  systems.  The  eddy  flux  re¬ 
lies  on  the  presence  of  radiative  air  mass  modification 
in  series,  but  the  magnitude  of  the  flux  does  not  depend 
on  the  details  of  this. 

On  the  other  hand,  the  heat  flux  across  the  north 
wall  of  the  Gulf  Stream  may  appear  to  be  just  a  func¬ 
tion  of  the  rate  of  meander  formation  and  subsequent 
generation  of  warm  core  rings,  which  then  lose  their 
excess  heat  through  air-sea  interaction.  Many  of  the 
rings,  however,  recirculate  and  rejoin  the  Gulf  Stream 
before  they  have  fully  decayed  (e.g.,  Olson  1991).  Pre¬ 
sumably,  therefore,  the  northward  heat  flux  in  this  case 
depends,  at  least  partly,  on  the  actual  strength  of  the 
air-sea  interaction  which  acts  in  series  with  the  initial 
eddy  flux. 

4.  Triple  Decomposition 

When  there  is  some  small-scale  turbulence  acting  in 
series  with  larger  scale  eddies,  some  insight  may  be 
obtained  from  a  triple  decomposition  of  the  concen¬ 
tration  field  into  mean,  eddies  and  turbulence  {Joyce 
1977;  Davis  1994).  It  is  assumed  that  there  is  a  spec¬ 
tral  gap  between  the  mean  and  the  eddies,  and  another 
gap  between  the  eddies  and  the  turbulence.  We  write 
G  =  Cm  +  Ce  +  Ct,  with  subscripts  m,  e  and  t  denot¬ 
ing  mean,  eddies  an^turbulence,  respectively.  For  any 
quantity  Q  we  write  Q  for  the  average  of  Q  over  a  time, 
or  space  scale,  large  compared  with  the  scale  of  the  tur¬ 
bulence  but  short  compared  with  that  of  the  eddies,  and 
(Q)  for  the  average  of  Q  over  a  time  or  space  scale  long 
compared  with  that  of  the  eddies  but  short  compared 
with  that  of  the  mean  state. 

Assuming  stationarity,  so  that  variances  do  not  change 
with  time,  and  homogeneity,  so  that  divergence  terms 
may  be  ignored,  it  is  straightforward  to  derive 

■  VCm  +  (UeCe)  •  V(7„  =  (x)  (6) 


Eddies 


Turbulence  . .  ►  Dissipation 

Figure  1.  A  simple  schematic  of  the  path  of  scalar  con¬ 
centration  variance  from  mean  to  dissipation  via  eddies  and 
turbulence. 

{^t)  +  =  -\ ix)  (7) 

(UeCe)  •  VCm  “  (utC*  •  VCe)  =  0  (8) 

where  x  —  2kVC'  •VC''  is  the  rate  of  dissipation  of 
scalar  variance,  as  before. 

The  first  equation  here  is  just  the  Osbom-Cox  equa¬ 
tion  if  one  does  no  separate  averaging  over  the  time 
scales,  essentially  lumping  eddy  and  turbulent  fluctu¬ 
ations  together.  The  second  equation,  however,  shows 
that  the  dissipation  may  be  regarded  as  coming  from 
the  production  of  variance  by  the  turbulence  acting  on 
both  the  mean  state  and  the  fluctuations  produced  by 
the  eddies.  The  third  equation  is  just  the  difference 
of  the  previous  two  and  shows  that  variance  generated 
by  the  eddies  acting  on  the  mean  state  must  be  passed 
on  to  dissipation  by  the  turbulence  acting  on  the  eddy 
fluctuations.  The  situation  is  summarized  in  Figure  1, 
showing  the  pathway  from  production  to  dissipation  via 
the  eddies  as  well  as  directly  from  the  turbulence  acting 
on  the  mean  state. 

The  molecular,  turbulent  and  eddy  fluxes  are  all  in 
parallel  in  determining  the  total  flux  at  a  fixed  point, 
but  it  is  not  necessary  for  the  eddy  flux  to  dominate 
the  turbulent  flux  in  the  same  way  as  the  turbulent 
flux  dominates  the  molecular  flux  in  the  simple  case. 
Just  as  considered  by  Nakamura  (1996)  and  Winters 
and  DAsaro  (1996)  for  the  simple  case,  one  may  think 
of  the  flux  in  different  ways.  At  a  fixed  point,  or  across 
the  contours  of  the  overall  mean  scalar,  the  flux  is  made 
up  of  the  molecular,  turbulent  and  eddy  fluxes  all  act¬ 
ing  in  parallel.  Alternatively,  the  flux  is  purely  molecu¬ 
lar  across  the  contours  of  the  instaneous  concentration 
field.  A  third  way  of  viewing  the  problem,  however, 
is  with  respect  to  contours  of  C  obtained  by  averaging 
over  the  turbulence  but  not  the  eddies.  In  this  case  the 
flux  is  carried  by  the  turbulence  and  molecular  diffusion 
in  parallel.  If  the  contours  defined  this  way  are  not  very 
convoluted  it  implies  that  the  transport  across  a  fixed 
surface  is  dominated  by  the  turbulence  rather  than  by 
the  eddies. 

There  are  likely  to  be  variations  on  this  theme,  but 
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it  does  seem  that  a  triple  decomposition  may  provide 
insight  in  some  situations,  bearing  in  mind,  however, 
that  there  may  not  be  convenient  spectral  gaps  between 
the  mean,  eddies  and  turbulence. 

5.  The  Relationship  Between  Isopycnal 
Stirring  and  Diapycnal  Mixing? 

We  tend  to  think  of  adiabatic  processes  associated 
with  mesoscale  eddies  in  the  ocean  as  being  indepen¬ 
dent  of  small-scale  diabatic  mixing  processes  caused  by 
things  like  breaking  internal  waves.  It  is  important  to 
consider  whether  this  is  true,  and,  if  not,  what  implica¬ 
tions  it  might  have  for  the  parameterization  of  diapy¬ 
cnal  mixing.  Two  previously  studied  problems  will  be 
briefly  reviewed  here.  The  first  concerns  the  ultimate 
fate  of  the  available  potential  energy  released  into  eddies 
in  baro clinic  instability;  the  second  concerns  the  final 
dissipation  mechanism  for  thermohaline  fronts  that  are 
presumably  generated  by  adiabatic  stirring  on  isopycnal 
surfaces. 

5.1  Does  baro  clinic  instability  lead  to  diapycnal 
mixing? 

A  popular  parameterization  scheme  for  the  relax¬ 
ation  of  mean  isopycnals  by  baroclinic  instability  in  the 
ocean  was  proposed  by  Gent  and  McWUliams  (1990).  It 
represents  the  horizontal  components  of  the  skew  veloc¬ 
ity  by  iKVhb/bz)z,  where  b  is  the  buoyancy  field.  Gent 
et  al.  (1995)  showed  that  the  effect  of  this  is  close  to 
that  of  having  a  vertical  eddy  viscosity  ^  {P/N^)k 
acting  on  the  mean  flow.  The  appropriateness  of  this 
scheme  over  one  invoking  the  mixing  of  potential  vor- 
ticity  is  discussed  elsewhere  in  this  volume.  The  point 
to  be  made  here  is  that,  as  discussed  by  Tandon  and 
Garrett  (1996),  one  needs  to  consider  the  ultimate  fate 
of  the  mean  available  potential  energy  released  to  the 
eddies.  It  seems  unlikely  that  it  is  dissipated  adiabati- 
cally  by  internal  friction  in  the  ocean.  Some  of  the  eddy 
damping  may  occur  via  air~sea  interaction  or  by  viscous 
damping  in  the  boundary  layer  at  the  sea  floor,  but  one 
also  needs  to  consider  the  possibility  that  the  energy  is 
lost  in  the  fluid  interior  and  that  some  fraction  of  this 
appears,  through  diapycnal  mixing,  as  an  increase  of 
the  basic  potential  energy. 

As  discussed  by  Tandon  and  Garrett  (1996),  the  rate 
of  release  from  the  mean  flow  is  {P/N^)K\uhz\^  for  an 
equivalent  eddy  viscosity,  as  above,  acting  on  the  verti¬ 
cal  shear  Uhz  of  the  mean  horizontal  flow  u/i.  Using  the 
thermal  wind  equation  =  /”^|V/i6|  this  energy 
loss  rate  becomes  s^kN^  where  s  -  \Vhb\/N^  is  the 
mean  isopycnal  slope.  If  the  rate  of  creation  of  mean 
potential  energy  is  at  a  rate  Rf  times  this,  and  is  ex¬ 


pressed  as  then  the  vertical  eddy  diffusivity  is 

==  RfS^K  and  could  be  as  large  as  10'~^m^s'~^  in 
places  like  the  Southern  Ocean.  Tandon  and  Garrett 
(1996)  argue  that  this  could  have  a  significant  effect  on 
tracers,  though  it  is  unlikely  to  significantly  augment 
the  spin-down  of  the  mean  flow  already  associated  di¬ 
rectly  with  the  Gent  and  McWilliams  mechanism. 

The  problem  thus  seems  to  warrant  continued  inves¬ 
tigation.  Strong  internal  wave  activity  and  inferred  high 
dissipation  have  been  reported  for  the  Southern  Ocean 
by  Polzin  and  Firing  (1997).  They  suggest  that  the 
waves  originate  as  lee  waves  generated  by  strong  flows 
over  a  rough  sea  floor,  and  that  these  strong  flows  in¬ 
clude  currents  associated  with  the  mesoscale  eddies.  In 
that  case  the  internal  waves  would  indeed  be  associated 
with  the  decay  of  the  baroclinic  eddies,  though  the  mix¬ 
ing  in  the  water  column  would  not  be  associated  with 
the  local  current  shear .  It  is  worth  considering  whether 
there  could  additionally  be  some  direct  local  connec¬ 
tion  between  the  mesoscale  eddy  current  shear  and  the 
excitation  of  internal  waves. 

5.2  Are  thermohaline  fronts  dissipated 
passively  or  actively? 

Compensating  lateral  gradients  of  potential  temper¬ 
ature  and  salinity  can  exist  on  isopycnal  surfaces  in  the 
ocean.  Purely  adiabatic  stirring  of  these  gradients  will 
presumably  lead  to  density-compensated  thermohaline 
fronts.  Some  small-scale  mixing  mechanisms  must  oc¬ 
cur  to  prevent  the  temperature  and  salinity  gradients 
from  increasing  without  limit. 

The  gradients  could,  of  course,  become  sharp  enough 
that  molecular  diffusion  alone  removes  them.  Alterna¬ 
tively,  there  might  be  some  small-scale  lateral  mixing 
mechanism,  perhaps  involving  vortical  modes,  which 
would  hasten  the  transfer  to  molecular  scales.  Both 
of  these  possibilities  could  be  described  as  “passive”  in 
the  sense  that  the  small-scale  mixing  mechanism  was 
in  place  already,  and  just  copes  with  the  extra  variance 
produced  by  the  isopycnal  stirring. 

Another,  more  likely,  passive  mechanism  was  ex¬ 
plored  by  Haynes  and  Anglade  (1997)  and  is  reviewed 
by  Haynes  in  this  volume.  The  fundamental  idea  is 
that  the  vertical  shear  of  the  stirring  process  leads  to 
the  development  of  vertical  scalar  gradients  at  the  same 
time  as  lateral  gradients.  Sharp  gradients  develop  in  the 
vertical  as  well  as  laterally,  with  diapycnal  mixing  then 
acting  to  smooth  them.  Haynes  and  Anglade  (1997)  ar¬ 
gue  that  a  lateral  frontal  width  of  a  few  kilometres  is  a 
plausible  outcome  for  reasonable  values  of  the  vertical 
shear. 

An  alternative  mechanism  for  the  thermohaline  fron- 
tolysis  was  proposed  by  Garrett  (1982)  and  reviewed 
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by  Garrett  (1989).  He  suggested  that  as  the  front  de¬ 
velops,  it  will  be  unstable  to  thermohaline  intrusions 
which  would  hmit  the  further  narrowing  of  the  front, 
effectively  dissipating  the  lateral  variance  production 
by  larger-scale  stirring.  This  might  be  termed  an  “ac¬ 
tive”  process  in  that  it  introduces  extra  diapycnal  mix¬ 
ing  that  would  not  have  occurred  without  the  lateral 
stirring.  The  associated  diapycnal  eddy  diffusivity  is 
inevitably  negative  for  density  (as  is  the  case  for  den¬ 
sity  for  any  process  driven  double-diffiisively),  but  is 
likely  to  be  positive  for  salinity  and  C2in  take  either  sign 
for  temperature,  depending  on  the  circumstances.  The 
process  deserves  further  study  if  the  effective  diapycnal 
diffusivities  that  one  would  add  to  a  model  for  large- 
scale  oceanic  behavior  reach  significant  values.  If  the 
diffusivities  are  very  small  one  would  argue  that  the 
process  is  interesting  scientifically  but  just  does  what  it 
has  to  without  having  any  large-scale  impact. 

The  formula  suggested  for  the  effective  diapycnal  dif¬ 
fusivity  of  salt  is 

Ks  =  10-3z?2JV(5^|V;,5|/^■2)3  (9) 

where  D  is  the  diameter  of  the  mesoscale  eddies  doing 
the  stirring,  /?  the  density  coefficient  for  salinity  and 
iV/iiSI  is  the  lateral  gradient  of  the  large-scale  salinity 
field  (over  a  scale  larger  than  that  of  the  eddies  causing 
the  stirring).  The  value  of  Ks  is  clearly  very  sensitive 
to  the  lateral  salinity  gradient  and  only  likely  to  be 
significant  in  a  few  places  such  as  the  Mediterranean 
salt  tongue  where  it  is  large  {Garrett,  1989). 

The  model  also  predicts  the  thickness  of  the  intru¬ 
sions  to  be  about  \{9l3\VhS\/N^)D.  This  could  be 
compared  with  data  in  any  systematic  examination  of 
CTD  profiles.  Of  course  the  intrusions  would  only  be 
foimd  in  the  narrow  frontal  regions.  The  width  of  these 
was  predicted  to  be 

W  -  0.08(g/3lVhSl/N^)nNn-^  (10) 

where  Ct  is  the  large  scale  strain  rate  (say  10  “^s”^  or 
less  typically). 

Presumably  this  active  frontolytic  mechanism  only 
occurs  if  the  width  from  (10)  is  larger  than  the  width 
in  the  passive  mechanism  proposed  by  Haynes  and 
Anglade  (1997).  The  particular  values  for  W  tabu¬ 
lated  by  Garrett  (1989)  show  that  this  may  be  the  case, 
though  clearly  there  is  considerable  uncertainty  in  both 
models. 

One  location  for  which  information  is  available  is  the 
site  of  the  North  Atlantic  Tracer  Release  Experiment 
(Ledwell  et  al  1993).  In  this  experiment  an  artificial 
tracer  injected  at  a  depth  of  about  300  m  was  teased  out 
into  streaks  by  a  strain  rate  estimated  as  3  x  10~'^s~‘^ 
and  reached  a  width  of  about  3  km.  This  width  was 


found  by  Haynes  and  Anglade  (1997)  to  be  plausible 
for  the  mechanism  they  described,  though  with  consid¬ 
erable  uncertainty  associated  with  the  lack  of  informa¬ 
tion  on  the  vertical  shear  of  eddy  currents. 

To  apply  the  model  considered  here,  we  need  to 
assume  that  at  the  same  time  as  the  tracer  became 
streaky,  the  associated  lateral  convergence  would  lead 
to  thennohaline  frontogenesis.  The  large-scale  isopy- 
cnal  salinity  gradient  in  this  region  is  approximately 
10-'^m“\  and  -  1.8  x  10“®  so  that  9(3\VhS\/N^  - 
4  X  10  If  we  take  D  ~  100  km,  then  (10)  gives  a 
frontal  width  of  about  5  km.  This  is  also  close  to  the 
observed  width.  A  point  against  this  interpretation  is 
that  no  intrusive  features  were  reported,  though  Gar- 
retVs  (1982)  model  would  predict  them  to  be  only  2  m  in 
vertical  extent  with  very  small  temperature  and  salinity 
signatures  and  so  perhaps  not  readily  observable. 

It  also  seems  quite  likely  that  in  this  particular  sit¬ 
uation  neither  of  the  models  discussed  here  constituted 
the  ultimate  lateral  mixing  process,  but  that  the  mix¬ 
ing  was  actually  accomplished  by  small-scale  vortical 
modes.  The  discussion  is  perhaps  somewhat  academic 
anyway;  even  if  Garrett  ^s  (1982)  mechanism  were  the 
ultimate  frontolytic  process  in  the  NATRE  region,  the 
regional  average  diapycnal  mixing  rate  for  salt,  using 
(9),  would  be  less  than  10““®m^s~^  and  so  utterly  neg¬ 
ligible.  (The  smallness  of  this  value  is  partly  associ¬ 
ated  with  the  small  area  occupied  by  the  thermohaline 
fronts.) 

Perhaps  the  realistic  conclusion  at  this  stage  is  that 
we  do  not  know  in  general  what  the  frontolytic  mecha¬ 
nism  is,  that  it  may  vary  from  place  to  place,  and  that 
in  a  few  situations  it  may  be  important  to  know  what 
it  is  if  it  introduces  further  diapycnal  mixing. 

6.  Mixing  Efficiency 

An  important  issue  in  considering  the  relationship 
of  stirring  to  mixing  in  the  diapycnal  sense  is  that  of 
the  mixing  efficiency:  how  much  of  the  energy  being 
put  into  stirring  motions  is  converted  to  the  potential 
energy  of  the  basic  state?  The  topic  is  discussed  by 
Barry  elsewhere  in  these  proceedings  and  also  by  St 
Laurent  and  Garrett  (2001). 

The  standard  approach  is  to  write 

K,  =  Te/N^,  r  =  Rf/{1  -  Rf),  (11) 

where  Rf  is  the  flux  Richardson  number,  representing 
the  fraction  of  energy  put  into  the  stirring  motions  that 
produces  a  diapycnal  buoyancy  flux  rather  than  being 
dissipated. 

It  is  customary  to  choose  T  to  be  about  0.2,  based  on 
comparisons  of  different  methods  of  measuring  the  di- 
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apycnal  difFusivity  (e.g.,  Oofcey  1982),  but  there  seems 
to  be  no  reason  why  it  should  be  a  imiversal  constant. 
Ivey  and  Imberger  (1991)  argue  that  F  varies  consider¬ 
ably  in  a  two-dimensional  space  described  by  two  di¬ 
mensionless  parameters.  One  is  the  “overturn  Proude 
number”  Fry  =  (Lr/Lc)^/^,  where  Lr  = 
is  the  Ozmidov  scale  and  Lc  is  “the  scale  of  the  most 
energetic  overturns”.  The  other  parameter  is  the  “over¬ 
turn  Reynolds  number”  Rer  =  {LcILkY^^,  where 
Lk  =  (j/3/e)V4  is  the  Kolmogorov  scale. 

We  note,  however,  that  Frr  is  an  “intemaF’  dimen¬ 
sionless  parameter,  with  its  value  determined  by  the 
turbulent  mixing  itself  rather  than  by  the  physical  pro¬ 
cesses  driving  the  mixing.  It  thus  does  not  seem  to  be 
a  satisfactory  descriptor.  We  might  also  expect  that  T 
should  be  independent  of  Rex  for  large  values  of  the  lat¬ 
ter  or,  equivalently,  of  the  turbulent  Re5rnolds  number 
e/ =  Fr^Rer  when  this  is  large. 

Thus  one  still  seeks  “external”  dimensionless  param¬ 
eters  on  which  T  might  depend.  One  wonders  whether  T 
might  be  less  if  the  shear  is  persistent  (compared  with 
iV“^)  at  a  particular  location;  then  for  much  of  the 
time  the  turbulence  is  mixing  water  that  is  already  well 
mixed,  hence  with  less  buoyancy  flux  resulting.  This 
persistence  might  be  associated  with  the  frequency  con¬ 
tent  of  the  internal  wave  spectrum  (with  an  enhance¬ 
ment  of  low  frequency  energy,  such  as  that  at  inertial 
and  tidal  frequencies)  increasing  the  persistence  and  re¬ 
ducing  r.  Alternatively,  persistence  could  be  a  conse¬ 
quence  of  greater  exceedance  of  some  stability  criterion: 
if  the  Richardson  number  goes  well  below  1  /4  at  its  min¬ 
imum,  then  its  duration  below  1/4  is  increased  over  a 
case  in  which  it  just  barely  goes  below  1  /4. 

Perhaps  an  apparent  dependence  of  T  on  €/(i/N^) 
might  just  be  because  this  parameter  is  acting  as  a 
proxy  for  some  other  indicator  of  mixing  strength  that 
does  not  involve  the  viscosity  u. 

A  test  in  the  field  could  of  course  come  from  simul¬ 
taneous  data  on  e  and  the  temperature  microstructure 
dissipation  rate  x  if  both  were  available,  as  in  Oakey 
(1982).  If  only  velocity  microstructure  data  and  coarser 
density  data  were  available  one  could  proceed  via  a  com¬ 
parison  of  Thorpe  and  Ozmidov  scales.  The  former,  Lx^ 
is  the  r.m.s.  displacement  in  reordering  a  density  profile 
to  be  stable.  The  latter,  Lqj  is  defined  as  (e/JV^)^/^.  If 
we  write  Lq  =  ocLx^  then  =  Te/N^  implies  that 

Ky  =  Ta^NL'^,  (12) 

With  r  ci  0.2  and  a  ~  0.79  (Dillon  1982)  or  0.66  (Craw- 
ford  1986)  this  gives  Ky  ~  O.INL^.  This  latter  result 
seems  likely  to  be  a  consequence  of  the  dynamics  of 
overturning  and  more  likely  to  be  a  general  result  than 
a  fixed  proportionality  between  Lq  and  Lx*  Hence  if  a 


is  found  to  differ  from  0.8,  one  could  use  (12)  to  imply 
r  ~  0.1a~^.  Interestingly,  Perron  et  al  (1998)  found 
a  cri  0.95  in  the  energetic  abyssal  mixing  in  the  Ro- 
manche  Fracture  Zone.  As  they  point  out,  the  error 
bars  on  all  these  values  of  a  are  such  that  they  could  be 
consistent  with  a  universal  value,  but  we  note  that  the 
higher  value  they  find  might  also  imply  a  lower  mixing 
efficiency. 

7.  The  Probability  Distribution  of 
Thorpe  Displacements 

The  existence  of  statically  unstable  portions  of  a  wa¬ 
ter  column  is  clear  evidence  for  vertical  stirring  and 
mixing.  The  nature  of  this  stirring  is  usually  charac¬ 
terized  by  the  single  parameter  of  the  Thorpe  scale,  as 
discussed  above.  Recently,  however,  Stansfield  et  al 
(2001)  have  suggested  that  more  information  might  be 
available  from  the  actual  probability  distribution  Pi{L) 
of  the  Thorpe  displacement  L  (the  particle  displace¬ 
ment  in  the  reordering  process).  The  Thorpe  scale  Lx 
is  simply  (J^  L^Pi{L)dL)^^^  .  Now 

Pi(L)=  r  HPs(L/H)P2{H)dH/  rH^P2(H)dH 
do  Jo 

(13) 

where  P2  (H)  is  the  probability  distribution  of  overturns 
of  thickness  H  (an  overturn  being  a  closed  set  of  dis¬ 
placements)  and  P:i(L/H)  is  the  normalised  probability 
distribution  of  displacement  L. 

Stansfield  et  al  (2001)  do  not  offer  any  model  for 
P2(H)^  but  suggest  a  very  simple  kinematic  model  for 
Pz(L/H),  based  on  the  assumption  that  each  particle 
has  an  equal  probability  of  going  to  any  other  location 
in  the  overturn.  This  is  equivalent  to  the  assumption 
that  any  rearrangement  of  the  elements  that  make  up 
an  overturn  is  equally  likely.  If  the  overturn  is  made  up 
of  n  points,  then  we  may  take  L/H  =  m  for  0  <  m  <  n 
and  it  is  easy  to  show  that  P(0)  =  1/n  and  otherwise 
P(7n)  =  2(n  “  7n)ln^,  Then  =  17^/6  for  large  n. 

There  are  subtleties  associated  with  this  model  as 
it  is  necessary  to  exclude  the  rearrangements  that  do 
not  constitute  a  single  overturn,  but  these  are  not  crit¬ 
ical  if  n  is  larger  than  about  20  or  so.  Figure  2  shows 
Pz{L/H)  for  winter  and  summer  data  from  Juan  de 
Fuca  Strait.  While  the  error  bars  are  not  shown  here, 
there  are  clearly  more  small  displacements,  and  fewer 
medium  and  large  displacements,  in  the  data  than  in 
the  model.  There  is  Httle  reason  to  regard  the  model 
as  anything  other  than  a  convenient  reference,  but  it 
seems  possible  that  examination  of  Pz(L/H)  and  also 
P2(H)  in  different  settings  might  reveal  different  mixing 
regimes  and  possibly  provide  an  indicator  of  different 
mixing  efficiencies. 
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Figure  2.  The  probability  distribution  of  displacements, 
normalised  by  the  overturn  height,  from  winter  1998  (thick 
line  )  and  summer  1998  (thin  line)  data  from  Juan  de  Puca 
Strait.  The  dashed  line  is  the  prediction  of  a  simple  kine¬ 
matic  model,  assuming  50  points  in  a  profile  (though  this 
number  only  affects  the  value  of  L/H  at  which  the  model 
prediction  rises  to  from  1  to  2.)  Prom  Stansfield  et  al. 
(2001). 

Conclusions 

This  short  review  has  attempted  to  raise  some  of 
the  issues  of  stirring  and  mixing  in  the  ocean,  bearing 
in  mind  that  a  primary  goal  is  to  derive  parameteri- 
zations  of  small-scale  processes  that  have  a  significant 
impact  on  important  aspects  of  the  ocean’s  behavior. 
An  overriding  issue  is  the  need  for  formulae,  not  just 
numbers,  for  any  parameterization  in  a  model  that  as¬ 
pires  to  predictive  capability.  If,  in  a  model  of  a  partic¬ 
ular  resolution,  several  sub-grid-scale  processes  appear 
to  be  acting  together,  it  is  clearly  important  to  decide 
which  processes  are  rate-controlling  so  that  theoretical 
and  observational  programs  can  be  focused  on  them. 
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Abstract.  A  characteristic  property  of  geostrophic  turbulence  is  that 
energy  imdergoes  an  inverse  cascade  to  large  spatial  scales,  whereas  potential 
enstrophy  cascades  directly  to  small  spatial  scales.  In  the  presence  of  weak 
dissipation,  such  as  in  the  ocean,  energy  is  therefore  quasi-conserved  whereas 
potential  enstrophy  is  always  dissipated-  As  a  consequence  geostrophic  eddies 
only  partially  mix  potential  vorticity  along  isopycnals,  with  the  amount  of 
mixing  being  dependent  on  the  energy  available  in  the  initial  state.  To 
illustrate  these  ideas,  two  oceanographic  applications  are  considered:  the 
generation  of  abyssal  recirculations  around  topographic  features  and  the 
formation  of  inertial  zonal  jets.  A  new  eddy  parameterization  is  developed 
that  dissipates  potential  enstrophy  subject  to  the  constraint  of  conserving 
energy.  Results  obtained  using  the  parameterization  are  compared  with  those 
from  an  eddy-resolving  calculation.  Possible  extensions  of  these  ideas  are 
discussed,  including  the  possibility  of  incorporating  additional  constraints 
such  as  conservation  of  angular  momentum. 

Introduction 

The  ocean  contains  an  intense  geostrophic  eddy  field, 
with  a  dominant  energy-containing  scale  of  order  100 
km  at  midlatitudes.  In  order  to  obtain  plausible  turbu¬ 
lent  cascades  of  dynamic  and  passive  tracers  it  is  nec¬ 
essary  for  models  to  resolve  even  finer  spatial  scales  of 
order  10  km  (e.g.,  Roberts  and  Marshall^  1998),  and 
consequently  ocean  climate  models  are  unlikely  to  ad¬ 
equately  resolve  the  geostrophic  eddy  field  for  several 
years.  Development  of  parameterizations  of  unresolved 
geostrophic  turbulent  processes  therefore  remains  an 
important  task  for  ocean  modelling  and  climate  predic¬ 
tion.  But  at  least  as  importantly,  development  of  pa¬ 
rameterizations  is  also  an  excellent  strategy  for  testing 
and  further  advancing  our  imderstanding  of  geostrophic 
turbulence  and  its  impact  on  the  large-scale  circulation. 

The  performance  of  numerical  ocean  models  has  re¬ 
cently  been  transformed  by  the  widespread  adoption 
of  the  Gent  and  McWilliams  (1990)  parameterization 
(hereafter  GM).  The  key  idea  behind  their  parame¬ 
terization  is  that  geostrophic  eddies  adiabatically  re¬ 
arrange  fluid  parcels  along  isopycnals,  without  chang¬ 
ing  the  density  of  individual  fluid  parcels.  Conse¬ 
quently  GM  represents  the  eddy  flux  of  a  passive  tracer 
through  both  a  diffusion  along  isopycnals  and  an  ad- 
vection  by  an  “eddy-induced  transport  velocity”  ( Gent 


et  aL,  1995).  These  adiabatic  conservations  properties 
have  led  to  a  series  of  dramatic  improvements  in  ocean 
models,  including  improved  global  temperature  distri¬ 
bution,  improved  poleward  and  surface  heat  fluxes,  and 
improved  occurrence  of  deep  convection  {Danabasoglu 
et  ai,  1994). 

Nevertheless  there  remain  significant  shortcomings  of 
GM.  For  example,  the  eddy-induced  transport  in  GM 
is  chosen  to  mimic  the  effects  of  baxoclinic  instability 
by  extracting  potential  energy  from  the  mean  flow,  thus 
driving  the  ocean  towards  a  state  of  rest  with  flat  isopy¬ 
cnals.  In  contrast  a  number  of  studies  suggest  that  un¬ 
der  appropriate  conditions,  geostrophic  eddies  can  ho¬ 
mogenize  the  potential  vorticity  field  along  isopycnals 
{Holland  and  Rhines^  1980;  Rhines  and  Young,  1982; 
Marshall  et  ah,  1999).  These  results  have  led  to  alter¬ 
native  closures  for  the  eddy-induced  transport  in  terms 
of  isopycnic  gradients  of  potential  vorticity  {Treguier  et 
al.,  1997;  Killworth,  1997).  However  other  studies  sug¬ 
gest  that  while  eddies  may  mix  potential  vorticity  along 
isopycnals,  the  potential  vorticity  field  is  not  necessar¬ 
ily  homogenized  (e.g.,  Bretherton  and  Haidvogel,  1976; 
Wang  and  Vallis,  1994). 

In  this  paper  we  review  two  oceanographic  phenom¬ 
ena  in  which  eddies  mix  the  potential  vorticity  field 
along  isopycnals  without  making  the  potential  vorticity 
tmiform.  Firstly  we  consider  the  formation  of  anticy- 
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Figure  1.  The  inverted  shallow  water  model.  Motion  is 
confined  to  a  dynamically  active  abyssal  layer,  thickness  /i, 
overlying  variable  bottom  topography,  height  H.  The  height 
of  the  layer  inteface  is  7;  =  H  -h  h.  (Prom  Adcock  and  Mar¬ 
shall^  2000.) 


clonic  recirculations  around  topographic  features  (also 
known  as  “cold  domes”).  We  diagnose  the  nature  of 
the  turbulent  cascades  in  the  formation  of  such  recir¬ 
culations,  based  on  which  we  develop  a  new  energy- 
conserving  eddy  parameterization.  Secondly  we  con¬ 
sider  the  formation  of  inertial  zonal  jets  embedded 
within  a  larger-scale  flow.  Finally  we  discuss  some  pos¬ 
sible  extensions  and  wider  implications  of  our  results. 


Abyssal  recirculations 


In  this  section  we  review  the  results  of  some  numer¬ 
ical  experiments  illustrating  the  interactions  between 
geostrophic  eddies  and  the  mean  circulation  in  the  pres¬ 
ence  of  variable  bottom  topography  (Adcock  and  Mar¬ 
shall^  2000).  The  experiments  are  performed  using  an 
“inverted”  shallow- water  model,  in  which  motion  is  con¬ 
fined  to  a  dynamically  active  abyssal  layer,  thickness  h, 
overlying  variable  bottom  topography,  height  H  (Fig¬ 
ure  1).  While  highly  idealized,  this  model  captures  the 
following  key  ingredients:  a  geostrophic  eddy  field,  a 
Rossby  deformation  radius,  a  variable  isopycnal  inter¬ 
face,  and  variable  bottom  topography.  In  these  experi¬ 
ments,  we  simply  initialize  with  an  ensemble  of  balanced 
geostrophic  eddies  and  integrate  forwards  in  time  for  40 
model  years. 

The  equations  of  motion  can  be  written: 


ou  du  du  ^ 

dv  dv  dv 


-AV\,  (1) 
-AV\,  (2) 


dt 


(3) 


Here  r}  ^  h  +  H  \s  the  height  of  the  layer  interface, 
u,  V  are  the  velocity  components  in  the  x,  y  direc¬ 
tions  respectively,  /  =  /o  -f  is  the  Coriolis  parameter 
where  /o  =  0.7  x  10~^s“”i  and  /3  =  2  x  lO-^^^m^^s-S 
and  g*  =  0.02  ms“^  is  the  reduced  gravity.  We  solve 
in  a  square  domain  (dimension  1000  km)  containing  a 
seamount  of  height  500  m.  The  mean  value  of  rj  is  750  m. 
A  scale-selective  biharmonic  dissipation  is  employed  in 
the  momentum  equations  to  dissipate  preferentially  at 
the  grid  ‘Iscale;  in  our  reference  experiment  we  employ 
a  dissipation  coefficient  of  A  =  2.5  x  10®m^s"^.  The 
equations  are  discretized  on  a  C-grid  with  a  grid  spac¬ 
ing  of  5  km.  Further  details  can  be  found  in  Adcock  and 
Marshall  (2000). 

Figure  2  shows  the  evolution  of  the  interface  height 
(equivalent  to  streamlines  for  the  geostrophic  flow). 
The  initial  interface  height  field  reveals  a  series  of 
geostrophic  eddies  with  maximum  geostrophic  veloci¬ 
ties  of  order  SOcms"”^  After  4  months  there  is  the  first 
indication  of  the  interface  rising  over  the  seamount.  Af¬ 
ter  5  years  there  is  an  intense  “cold  dome”  over  the 
seamoimt  and  an  associated  anticyclonic  recirculation 
with  velocities  of  order  40cms“^  This  intense  recircu¬ 
lation  remains  even  after  40  years.  The  amplitude  of 
the  recirculation  is  dependent  on  the  amplitude  of  the 
initial  eddy  field.  Notice  that  at  all  stages  the  inter¬ 
face  height  field,  and  thus  also  the  energy  field,  remains 
trapped  at  scales  comparable  to,  or  larger  than,  the  de¬ 
formation  radius. 

In  Figure  3  we  show  the  equivalent  snapshots  of  the 
potential  vorticity  field, 


/  +  C 
h  ’ 


(4) 


where  C  =  dvjdx  —  dujdy  is  the  relative  vorticity.  At 
leading  order,  potential  vorticity  variations  are  dom¬ 
inated  by  the  variations  in  the  bottom  topography, 
with  high  values  of  potential  vorticity  found  over  the 
seamount  and  low  values  found  off  the  seamount.  Af¬ 
ter  4  months  there  is  clear  evidence  of  the  direct  cas¬ 
cade  of  the  potential  vorticity  field  towards  small  spatial 
scales  with  filaments  of  high  potential  vorticity  moving 
off  the  seamount  and  filaments  of  low  potential  vortic¬ 
ity  moving  onto  the  seamount.  After  5  and  40  years, 
the  maximum  value  of  the  potential  vorticity  is  signif¬ 
icantly  reduced  over  the  seamount,  but  note  that  the 
potential  vorticity  field  is  not  made  uniform.  The  re¬ 
duced  potential  vorticity  is  associated  with  increased 
layer  thickness  over  the  seamount,  and  hence  the  cold 
doming  and  the  anticyclonic  circulation.  In  contrast, 
a  imiform  potential  vorticity  state  would  require  a  far 
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Figure  2,  Interface  height,  r)  (shading  indicates  values 
greater  than  750  m).  (a)  Initial  conditions;  (b)  after  3 

months;  (c)  after  5  years;  (d)  after  40  years.  Contour  in- 
terval=:10m,  (From  Adcock  and  Marshall,  2000.) 


Figure  3.  Potential  vorticity  in  the  abyssal  layer,  (a)  Initial 
conditions;  (b)  after  3  months;  (c)  after  5  years;  (d)  after 
40  years.  Contour  interval=2  x  (From  Adcock 

and  Marshall,  2000.) 


greater  doming  of  the  layer  interface  over  the  seamount, 
which  is  impossible  without  an  input  of  energy. 

More  quantitatively  we  can  consider  the  evolution  of 
the  net  available  energy 

„  /•/• /ft(u2+t;2)  g'h{h+2H)'\  ^  , 

E  =  Po]j\j^  2  '^2 - (5) 

and  the  net  potential  enstrophy, 

^  =  (6) 

the  latter  being  a  measure  of  the  variance  of  the  po¬ 
tential  vorticity  field  and  thus  the  extent  to  which  the 
potential  vorticity  field  is  mixed.  Figure  4(a)  shows  the 
available  energy  over  the  first  five  model  years  for  differ¬ 
ent  values  of  biharmonic  momentum  dissipation.  The 
amount  of  energy  dissipated  increases  with  the  coeffi¬ 
cient  of  momentum  dissipation,  but  in  each  case  the 
energy  dissipated  is  small  compared  with  the  energy 
that  would  be  dissipated  if  the  eddies  were  spinning  the 
ocean  down  towards  a  state  of  rest.  Figure  4(b)  shows 
the  potential  enstrophy  over  the  same  period.  The  po¬ 
tential  enstrophy  rapidly  decreases  below  the  level  con¬ 
sistent  with  spin-down  towards  a  state  of  rest,  but  sim¬ 
ilarly  never  approaches  the  level  consistent  with  a  state 
of  uniform  potential  vorticity.  The  evolution  of  the  po¬ 
tential  enstrophy  is  remarkably  insensitive  to  the  level 


of  momentum  dissipation.  These  results  suggest  that 
eddies  are  neither  driving  the  model  ocean  down  to¬ 
wards  a  state  of  rest,  as  assumed  in  GM,  nor  towards  a 
state  of  uniform  potential  vorticity.  Instead  the  ocean  is 
driven  towards  a  state  of  minimum  potential  enstrophy, 
subject  to  the  constraint  of  (nearly)  conserving  the  net 
available  energy  (c.f.  Bretherton  and  Haidvogel,  1976). 


An  energy-conserving  parameterization 

The  above  results  can  be  exploited  to  develop  a  new 
geostrophic  eddy  parameterization.  As  in  GM  we  repre¬ 
sent  the  eddies  through  a  rearrangement  of  fluid  parcels 
or  "eddy-induced  transport”,  such  that  the  conti¬ 
nuity  equation  becomes 

^  +  V.(/iu  +  U*)  =  0.  (7) 

We  write  the  eddy-induced  transport  in  the  form 


U’  =  U-„a, 


(8) 


where 


fj  a.adxdy 

SI  dxdy 


(9) 


Here  C/q  is  taken  as  an  externally  prescribed  parame¬ 
ter  controlling  the  rate  at  which  rearrangements  occur, 
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Figure  4.  Evolution  of  (a)  the  total  available  energy  and 
(b)  the  total  available  potential  enstrophy  over  the  first  five 
years.  The  four  curves  correspond  to  different  coefficients  of 
biharmonic  momentum  dissipation.  The  thick  solid  line  in 
(a)  corresponds  to  the  energy  of  the  minimum  energy  state 
in  which  the  ocean  is  at  rest.  The  upper  thick  line  in  (b)  cor¬ 
responds  to  the  potential  enstrophy  of  the  minimum  energy 
state  and  the  lower  thick  line  corresponds  to  the  potential 
enstrophy  of  the  imiform  potential  vorticity  state.  (From 
Adcock  and  Marshall,  2000.) 


while  a  describes  the  spatial  pattern  of  these  rearrange¬ 
ments. 

We  now  seek  the  form  of  a  that,  in  a  given  time 
interval  At,  maximizes  the  dissipation  of  potential  en¬ 
strophy. 


-AA  =  JJ  DW^dxdy.  (10) 

subject  to  conserving  the  total  available  energy, 

AE  =  At  JJlJWBdxdy  =  0,  (11) 


where  B  =  g'r]  +  -f  v^)/2  is  the  Bernoulli  poten¬ 
tial.  Taking  variations,  Sa,  in  order  to  maximize  (10), 
subject  to  the  constraints  (9)  and  (11),  gives 


U*=«(v^  +  AVb).  (12) 

Here  « is  a  Lagrange  multiplier  determined  uniquely  by 
(9),  and  A  is  a  Lagrange  multiplier  determined  uniquely 
by  the  energy  constraint  (11). 

The  first  term  in  (12)  corresponds  to  a  potential  vor¬ 
ticity  closure  for  U*,  whereas  to  leading  order  the  sec¬ 
ond  term  corresponds  to  the  GM  closure.  The  new  pa¬ 
rameterization  thus  contains  elements  of  both  the  GM 
and  potential  vorticity  closures,  and  results  in  partial, 
but  not  total,  mixing  of  potential  vorticity. 

To  illustrate  the  use  of  the  parameterization,  in  Fig¬ 
ure  5  we  show  the  results  of  three  parallel  simulations, 
the  first  at  5  km  resolution  with  resolved  geostrophic 
turbulence,  the  second  at  40  km  resolution  and  includ¬ 
ing  the  parameterization  of  the  geostrophic  turbulence. 


and  the  third  at  40  km  resolution  with  neither  resolved 
nor  parameterized  turbulence.  The  integrations  are  ini¬ 
tialized  with  the  same  eddy  field  shown  in  Figure  2(a). 
In  the  high-resolution  case  (Figure  5a,  b)  an  intense  an- 
ticyclonic  recirculation  is  evident  after  30  model  weeks. 
The  coarse  resolution  case  with  the  parameterized  tur¬ 
bulence  (Figure  5c,  d)  is  able  to  reproduce  a  plausible 
anticyclonic  recirculation,  whereas  the  coarse  resolution 
case  without  the  parameterized  turbulence  (Figure  5e, 
f)  produces  only  a  very  weak  recirculation.  Note  that 
the  maximum  value  of  the  potential  vorticity  over  the 
seamount  is  much  higher  in  the  latter  case.  While  there 
are  some  differences,  particularly  in  the  details  of  the 
circulation  off  the  seamount,  we  are  encouraged  by  the 
ability  of  the  parameterization  to  reproduce  a  plausible 
doming  of  the  layer  interface  over  the  seamount. 


Zonal  jets 


The  formation  of  abyssal  recirculation  gyres  repre¬ 
sents  a  situation  in  which  potential  vorticity  mixing  is 
limited  by  the  amount  of  energy  available  in  the  ini¬ 
tial  state.  However  even  in  the  situation  that  there  is  a 
ready  supply  of  energy,  the  inverse  cascade  of  energy  to 
large  spatial  scales  can  still  restrict  the  extent  to  which 
potential  vorticity  can  be  mixed.  This  issue  is  a  topic  of 
ongoing  research,  but  here  we  present  some  preliminary 
results  that  highlight  some  of  the  issues  involved. 

Specifically  we  consider  eastward  flow  past  a  cylinder 
in  a  barotropic  ^-plane  charmel.  The  flow  is  maintained 
through  a  sponge  layer  (shaded  region  on  the  figures) 
within  which  the  zonal  velocity  is  rel2ixed  towards  a  uni¬ 
form  value.  These  experiments  were  originally 
to  study  the  dynamics  of  flow  separation;  further  details 
can  be  found  in  Tansley  and  Marshall  (2001),  Marshall 
and  Tansley  (2001).  Here  we  focus  on  just  one  aspect 
of  these  flows:  the  formation  of  zonal  jets. 

We  solve  the  nondimensional  barotropic  vorticity 
equation 

+  (13) 

where  ^  =  dv/ dx  —  du/dy  is  the  relative  vorticity. 


is  the  Reynolds  number,  and 


/3L^  _  ^ 

U  -LI 


(14) 


(15) 


is  a  non-dimensional  “^-parameter”  or  the  “Rhines 
number”;  the  latter  term  is  motivated  by  the  “Rhines 
scale”,  Lr  =  \/UIP,  that  controls  the  meridional  scale 
of  zonal  jets  in  geostrophic  turbulent  flows  {Rhines, 
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(a)  Re  =  25 


(b)  Re  =  200 


Figure  6.  Streamfunction  (nondimensionalized)  from 
three  solutions  with  0-75  and  (a)  Re  =  25,  (b)  Re  =  200 
(c)  Re  =  1000.  Contour  interval  =  0.2,  Shading  indicates 
the  sponge  layer  within  which  the  zonal  velocity  is  relaxed 
towards  a  uniform  value.  (From  Tansley  and  Marshall. 
2001.) 


(a)  Re  =  25 


Figure  5.  Snapshots  of  the  interface  height  (panels  a,  c,  e) 
and  potential  vorticity  (panels  b,  d,  f)  after  30  weeks  of  in¬ 
tegration.  Panels  a  and  b  correspond  to  the  5  km  resolution 
case  with  resolved  turbulence;  panels  c  and  d  correspond  to 
the  40  km  resolution  case  with  the  parameterization;  panels 
e  and  f  correspond  to  the  40  km  resolution  case  without  the 
parameterization.  Contour  intervals  as  in  Figures  2  and  3. 
(Prom  Adcock  and  Marshall,  2000.) 


(b)  Re  =  200 


1975).  Here  L  is  the  diameter  of  the  cylinder,  U  is 
the  mean  zonal  velocity,  v  is  the  coefficient  of  lateral 
viscosity,  and  /3  is  the  gradient  in  the  Coriolis  parame¬ 
ter. 

Here  we  show  solutions  only  for  the  value  0  =  75, 
and  for  three  values  of  the  Reynolds  number.  The  latter 
controls  the  level  of  dissipation,  and  hence  the  ampli¬ 
tude  of  the  eddy  field.  The  streamfunction  (^,  defined 
such  that  u  =  —d'^/dy,  v  =  d'll^/dx)  is  shown  in  Figure 
6  and  the  vorticity  (g  =  C  -f  0y)  is  shown  in  Figure  7. 

Even  at  the  lowest  Reynolds  number  of  25  (panels 


(c)  Re  =  1000 


Figure  7.  Vorticity  (nondimensionalized)  from  three 
solutions  with  4  =  75  and  (a)  Re  =  25,  (b)  Re  =  200,  (c) 
Re  =  1000.  Contour  interval  =  15.  (Prom  Tansley  and 
Marshall,  2001.) 
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a),  the  circulation  is  significantly  altered  from  the  non-* 
rotating  case  (not  shown).  Rather  than  a  separated 
region  downstream  of  the  cylinder,  we  find  a  separated 
region  upstream  of  the  cylinder.  This  is  related  to  the 
westward  propagation  of  Rossby  waves,  and  the  ability 
of  the  circulation  to  accommodate  western  boundary 
currents,  but  not  eastern  boundary  currents  (e.g.,  see 
Merkine,  1980;  Page  and  Johnson,  1990;  Tansley  and 
Marshall,  2001).  There  is  a  slight  hint  of  a  strongly 
damped  Rossby  wave  downstream  of  the  cylinder,  but 
no  turbulent  eddies. 

However  at  the  higher  Reynolds  numbers  of  200 
(panels  b)  and  1000  (panels  c),  the  circulation  is  funda¬ 
mentally  altered  downstream  of  the  cylinder.  At  these 
higher  Reynolds  numbers,  the  Rossby  waves  break  to 
form  a  turbulent  wake.  Within  this  wake  there  is  a 
direct  cascade  of  enstrophy  and  an  inverse  cascade  of 
energy;  the  latter  is  arrested  in  the  meridional  direc¬ 
tion  at  the  Rhines  scale  {Khines,  1975),  leading  to  the 
formation  of  a  series  of  zonal  jets.  These  jets  extend 
further  downstream  of  the  cylinder  at  the  higher  value 
of  the  Reynolds  number  since  the  turbulent  wake  itself 
extends  further  downstream. 

Just  as  in  the  formation  of  the  abyssal  recircula¬ 
tions,  the  vorticity  is  mixed  by  the  eddy  field,  but  it 
is  not  made  uniform.  Rather  there  are  “plateaus”  of 
nearly  uniform  vorticity,  separated  by  narrow  regions 
of  enhanced  vorticity  gradient.  Confirmation  that  the 
eddy  field  is  responsible  for  accelerating  the  zonal  jets  is 
given  in  Figure  8,  which  shows  the  convergence  of  the 
eddy  vorticity  flux  (-V.^u',  where  the  overbar  and 
primes  represent  time-mean  and  time-varying  compo¬ 
nents),  and  the  mean  dissipation  of  vorticity  through 
the  right-hand  side  of  (13).  While  the  eddy  term  con¬ 
tains  a  rich  structure,  there  is  an  unambiguous  tendency 
to  enhance  the  vorticity  gradient  across,  and  hence  ac¬ 
celerate,  the  zonal  jets.  In  contrast  the  dissipation  acts 
to  decelerate  the  jets. 

Discussion 

In  this  paper  we  have  considered  the  impact  of 
geostrophic  eddies  on  the  large-scale  circulation  from 
the  perspective  provided  by  geostrophic  turbulence  the¬ 
ory.  Energy  undergoes  an  inverse  cascade  to  large  spa¬ 
tial  scales,  whereas  potential  enstrophy  cascades  di¬ 
rectly  to  small  spatial  scales.  Consequently  we  argue 
that  geostrophic  eddies  only  partially  mix  potential  vor¬ 
ticity  along  isopycnals,  with  the  amount  of  mixing  being 
dependent  on  the  energy  available  in  the  initial  state. 

The  need  for  an  energetic  constraint  is  clear  in  the 
formation  of  abyssal  recirculations,  since  energy  is  re¬ 
quired  to  raise  isopycnals  over  a  seamount.  However  we 
suggest  that  an  energetic  constraint  might  also  be  ap- 


(a)  -V.C'u' 


Figure  8.  (a)  Convergence  of  the  eddy  vorticity  flux  and 
(b)  mean  frictional  dissipation  from  the  solution  with 
Re  =  1000  and  ^  =  75.  Two  contour  intervals  are 
employed:  1.0  between  values  of  -4.5  and  4.5;  and  10.0  for 
values  of  greater  magnitude;  shading  indicates  negative 
contours.  Only  the  central  portion  of  the  channel  is  shown. 
(From  Tansley  and  Marshall,  2001.) 


propriate  even  in  the  case  in  which  unconstrained  mix- 
ing  of  potential  vorticity  leads  to  a  net  decrease  in  en¬ 
ergy,  since  energy  remains  trapped  at  relatively  large 
spatial  scales.  In  Figure  9  we  speculate  on  the  impli¬ 
cations  for  the  slumping  of  a  baroclinic  front.  Accord¬ 
ing  to  GM,  the  eddy-induced  transport  acts  to  flatten 
the  isopycnals,  with  a  single  overturning  cell  extending 
across  the  baroclinic  front.  However  this  slumping  will 
result  in  a  release  of  potential  energy  which  must  be 
converted  to  kinetic  energy.  Alternatively  we  suggest 
that  the  the  eddy-induced  transport  may  initially  be 
confined  to  a  series  of  localized  overturning  cells  that 
act  to  convert  available  potential  energy  into  the  ki¬ 
netic  energy  of  zonal  jets.  This  kinetic  energy  will  in 
turn  be  dissipated  through  bottom  drag  and/or  internal 
wave  breaking,  allowing  further  slumping  to  proceed.  If 
this  hypothesis  is  correct,  then  the  rate  at  which  eddies 
transport  fluid,  and  tracers,  across  a  baroclinic  front 
should  be  dependent  on  the  nature  and  level  of  dis¬ 
sipation  in  an  eddy-resolving  calculation.  Work  is  in 
progress  to  explore  these  issues. 

There  is  also  a  distinct  possibility  that  there  are  addi¬ 
tional  constraints  on  the  mixing  of  potential  vorticity. 
For  example  in  a  zonal  domain  with  no  topographic 
variations,  one  should  expect  angular  momentum  to  be 
conserved  (e.g.,  Marshall,  1981),  It  is  also  unclear  how 
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Figure  9.  A  hypothesis  for  the  impact  on  an  energetic  con- 
straint  on  the  slumping  of  a  baroclinic  front.  We  suggest 
that  the  eddy-induced  transport  (U*)  may  be  confined  to  a 
series  of  localized  overturning  cells  that  act  to  convert  avail¬ 
able  potential  energy  into  the  kinetic  energy  of  zonal  jets. 
The  overall  meridional  transport  will  then  depend  crucially 
on  the  dissipation  of  the  jets  by  bottom  friction  and/or  in¬ 
ternal  wave  breaking. 

these  constraints  are  modified  in  the  presence  of  explicit 
forcing  and  dissipation,  and  indeed  whether  one  should 
take  into  account  the  input  of  energy  from  smaller,  un¬ 
resolved  spatial  scales  as  suggested  by  Holloway  (1987). 
There  is  also  the  possibility  of  energy  transfer  to  small 
spatial  scales  in  the  presence  of  small-scale  variations 
in  bottom  topography.  These  issues  all  require  further 
study. 

We  have  developed  a  simple  energy  conserving  eddy 
parameterization  in  this  paper.  In  principle  it  should 
be  possible  to  incorporate  additional  constraints  as  re¬ 
quired,  However  there  are  some  significant  limitations 
of  our  approach.  In  particular  we  apply  our  energy  con¬ 
straint  in  a  global,  rather  than  local,  sense.  The  alter¬ 
native  approaches  of  Sadoumay  and  Basedevant  (1985) 
and  Holloway  (1992)  may  provide  more  practical  ways 
forward. 
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Abstract.  Some  reqiurements  on  eddy  parameterisations  are  discussed, 
especially  the  implications  of  expressing  them  in  terms  of  the  quasi-Stokes 
velocity  and  the  modified  mean,  rather  than  Eulerian  mean,  density.  The 
difference  between  the  two  means  is  second-order  in  perturbation  amplitude 
and  thus  small  in  the  fiuid  interior  (where  formulae  to  connect  the  two  exist). 
Near  horizontal  boundaries,  the  differences  become  first  order,  and  so  more 
severe.  Existing  formulae  for  quasi-Stokes  velocities  and  streamfunction 
also  break  down  here.  The  layer  in  which  the  largest  differences  between 
the  two  mean  densities  occur  is  the  vertical  excursion  of  a  mean  isopycnal 
across  a  deformation  radius,  at  most  about  20  m  thick.  Most  climate  models 
would  have  difficulty  in  resolving  such  a  layer.  It  is  shown  that  extant 
parameterisations  appear  to  reproduce  the  Eulerian,  and  not  modified  mean, 
density  field  and  so  do  not  yield  a  narrow  layer  at  surface  and  floor  either. 
Both  these  featmes  rnake  the  quasi-Stokes  streamfunction  appear  to  be 
non-zero  right  up  to  rigid  boundaries,  so  that  we  must  query  what  are  the 
relevant  surface  and  floor  quasi-Stokes  streamfunction  conditions,  and  what 
are  their  effects  on  the  density  fields.  To  answer  this,  a  variety  of  eddy 
parameterisations  is  employed  for  a  channel  problem,  and  the  time-mean 
density  is  compared  with  that  firom  an  eddy-resolving  calculation.  The 
parameterisations  were  only  successful  if  the  vertical  component  of  the  quasi- 
Stokes  velocity  vanished  at  top  pd  bottom  as  in  current  practice,  but  all 
were  almost  equally  successful  given  proper  tuning.  One  parameterisation, 
based  on  linear  instability  theory,  is  extended  to  a  global  geometry.  In  low 
and  mid-latitudes,  because  the  predominant  orientation  of  the  instability 
wavevector  is  north-south,  the  main  quasi-Stokes  flow  is  east-west,  only 
becoming  the  more  traditional  north-south  at  higher  latitudes. 


1.  Introduction 

The  ocean  component  of  climate  models  is  necessar¬ 
ily  of  coarse  resolution,  and  it  does  not  possess  eddies. 
This  paper  discusses  various  aspects  of  the  issue  of  rep¬ 
resenting  eddies  in  such  models.  First,  it  discusses  the 
extant  parameterisations  (Section  2).  Section  3  exam¬ 
ines  some  hitherto  imappreciated  details,  with  reference 
to  the  ‘modified  mean  density’  concept;  it  is  shown  that 
the  difference  between  this  density  and  the  Eulerian 
mean  is  largest  at  surface  and  floor.  The  region  that 
this  difference  occupies  is  too  shallow  to  be  resolved  by 
coarse  models,  so  that  eddy  effects  occiu:  in  what  ap)- 
pears  to  such  models  as  a  delta-function.  Current  pa¬ 
rameterisations  do  not  generate  solutions  which  include 
these  differences,  even  with  adequate  resolution.  Some 
eddy-resolving  channel  experiments  are  used  in  Section 
5  to  explore  whether  the  relevant  boimdary  condition  is 
that  of  no  eddy  flux  (formally  correct  beyond  the  thin 
layer)  or  not;  the  conclusion  is  that  the  usual  boundary 


condition  must  be  employed.  Last,  a  parameterisation 
based  on  Hnear  instability  theory  (introduced  in  Section 
4)  is  extended  in  Section  6  firom  channel  simulations  to 
the  global  domain  and  is  briefly  compared  with  the  Gent 
and  McWilliams  (1990)  parameterisation.  The  direc¬ 
tionality  of  linear  instabihty  at  low  and  mid-latitudes 
is  such  that  the  ‘bolus’  fluxes  are  oriented  east-west, 
rather  than  north-south,  at  these  latitudes,  because  the 
beta  effect  constrains  maximal  instability  to  a  north- 
south  orientation  (which  yields  east- west  fluxes). 

2.  Background 

A  variety  of  schemes  has  been  suggested  to  include 
eddy  effects  in  coarse-resolution  ocean  models.  These 
schemes  divide  into  two  categories.  The  first,  which  we 
shall  be  examining  here,  involves  adding  terms  to  rep¬ 
resent  the  additional  thickness  flux  by  baroclinic  eddies 
{Gent  and  McWilliams,  1990;  Greatbatch  and  Lamb, 
1990;  Gent  et  al,  1995;  Visbeck  et  at,  1997;  Treguier 
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et  ai,  1997;  Killworth,  1997,  1998;  Greatbatch,  1998). 
The  second  (Neptune)  involves  a  representation  of  the 
statistical  properties  of  eddies  on  the  mean  flow  {Eby 
and  Holloway,  1994;  Merryfield  and  Holloway,  1997), 
and  will  not  be  discussed  in  detail  here.  Methods  to 
represent  propagating  features  (e.g.  Agulhas  rings)  do 
not  seem  yet  to  be  available. 

Eddy  parameterisations  have  been  designed  with  var¬ 
ious  criteria  in  mind  by  different  authors.  Initial  re¬ 
quirements  include  mimicking  both  baroclinic  instabil¬ 
ity  -  “reducing  APE”,  in  some  sense,  amd  barotropic 
instability  -  “reducing  lateral  shear”,  as  well  as  the  Nep¬ 
tune  approach  of  making  the  ocean  resemble  a  rectifled 
eddying  ocean.  All  these  are  quasi-steady  in  natme 
and  ignore  the  fact  that  instability  generates  variability 
on  weeks  to  decades  through  nonlinear  interactions.  I 
am  not  aware  of  any  parameterisation  which  attempts 
to  put  temporal  variability  into  the  ocean  (by  stochas¬ 
tic  forcing,  perhaps),  so  that  coupled  ocean-atmosphere 
models,  even  with  parameterisations,  are  unlikely  to 
possess  a  realistic  degree  of  self-induced  oceanic  vari¬ 
ability. 

Two  other  pragmatic  requirements  are  that  parame¬ 
terisations  should  not  induce  erroneous  ocean  behaviour 
(e.g.  breaking  conservation  laws  such  as  momentum), 
and  should  ensure  that  the  numerics  of  the  models  be¬ 
have  (the  original  concept  behind  eddy  diffusivity,  of 
course). 

The  manner  in  which  a  parameterisation  is  couched 
depends  on  what  belief  structure  about  eddy  behaviour 
is  used.  In  the  literature  already  are  suggestions  for 
thickness  smoothing,  potential  vorticity  {q)  conserva¬ 
tion,  energy  loss,  energy  conservation,  and  q  smoothing. 
These  effects  are  usually  placed  in  the  tracer  equations, 
but  it  is  possible  to  include  effects  in  momentum  equa¬ 
tions  also.  There  are  potential  difficulties  near  horizon¬ 
tal  surfaces,  discussed  later. 

2.1  Formulations 

Most  authors  seek  an  “equivalent  to  the  bolus  veloc¬ 
ity”,  namely  some 

(u"'', 

which  is  fully  three-dimensional  and  non-divergent,  such 
that  U-I-U+  advects  (a  form  of)  the  density  adiabatically 
-  i.e.  build  in  our  belief  that  density  (or  neutral  den¬ 
sity)  effects  are  adiabatic.  This  is  usually  represented 
as  a  2-dimensional  streamfunction 

^  =  (V’l.^z) 

such  that 

=  tpu,  =  ■02z,  =  -Vh  • 


The  most  logical  approach  to  date  is  the  transient- 
residual-mean  (TRM)  theory  introduced  by  McDougall 
(1998,  and  earlier  references  therein);  McDougall  and 
McIntosh  (2001,  hereafter  MM)  give  more  detail  on  the 
same  material.  Another,  highly  related,  approach  is  to 
use  density-weighted  averaging  (cf.  Greatbatch,  submit¬ 
ted  ms;  de  Szoeke  and  Bennett,  1993).  The  TRM  theory 
apphes  to  low-pass  temporally  averaged  quantities  and 
deduces  a  quasi-Stokes  velocity  u+  which  is  related,  but 
not  identical,  to  the  bolus  velocity.  (The  two  are  not 
identical  because  the  background  mean  flow  involves 
averages  on  two  different  surfaces,  though  they  are  fre¬ 
quently  similar.)  Formulae  have  been  derived  for 
perturbations  by  McDougall  (1998)  and  MM,  involving 
only  averages  at  constant  depth.  The  quasi-Stokes  vec¬ 
tor  streamfunction  is  given  to  second  order  in  amplitude 
by 


-  ^'hP'  ,  ^  \ 

pz  pz  \pz )  ^ 

where  the  suffix  H  denotes  the  horizontal  component, 
and  (j>  =  (l/2)p'2.  xhe  vertical  derivative  of  ^  is  the 
horizontal  component  of  u*^.  The  second  term  is  usually 
small  compared  with  the  first  and  is  neglected  hence¬ 
forth. 

Since  eddying  motions  are  believed  to  conserve  den¬ 
sity,  this  implies  that  the  definition  of  density  must  be 
modified.  McDougall  (1998)  shows  that  rather  than  us¬ 
ing  the  Eulerian  mean  density  p  at  a  (vertical)  point 
(EMD  for  short),  one  should  interpret  density  as  be¬ 
ing  the  inversion  of  the  mean  depth  of  a  given  density 
(termed  the  ‘modified  mean  density’  p,  or  MMD  for 
short).  The  difference  between  these  two  fields  p  and 
p  is  again  of  second  order  in  small  quantities  and  is 
thus  very  small  where  the  TRM  theory  is  formally  valid. 
However,  the  time  derivatives  of  EMD  and  MMD  differ 
by  0(1)  amounts  because  of  the  above  discussion.  The 
MMD  is  advected  by  the  (Eulerian)  mean  flow  and  by 
the  quasi-Stokes  velocity: 

Pt4- V*  [(u  +  u*^)^]  =  0. 

We  shall  see  that  near  horizontal  boundaries,  the 
small-amplitude  formulae  of  McDougall  (1998)  and  MM 
to  convert  EMD  to  MMD  break  down.  In  fact,  the 
two  fields  differ  at  first,  not  second,  order  in  the  small 
quantities.  (This  is  nothing  to  do  with  the  question  of 
neutrally  stable  and  mixed  layers,  which  are  beyond  the 
scope  of  this  paper.) 

The  earliest  parameterisation  was  a  simple  pair  of 
diffusivities:  +  ^vPzz-  This  suffers  from  the 

well-known  Veronis  effect,  in  that  fluid  within  a  density 
class  is  not  conserved.  The  now  classic  GM  parameter- 
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isation  (Gent  and  McWilliams,  1990)  takes 

Pz 

though  the  density  is  arguably  p  and  not  p.  This  both 
conserves  layer  thickness  (which,  note,  is  an  integral, 
not  a  conserved  quantity)  and  “smooths”  them.  It  fits 
conveniently  into  the  isopycnic  mixing  tensor  formal¬ 
ism  of  Solomon  (1981)  and  Redi  (1982).  Visheck  et  al 
(1997)  give  a  variant  with  horizontally  varying  diffusion 
coeflScient  based  on  baroclinic  instability.  Killworth 
(1997)  uses  an  approximate  solution  to  (single  wave) 
baroclinic  instability,  obtaining  a  three-dimensionally 
varying  diffusion  coefficient,  which  vanishes  for  stable 
flow.  The  scheme  mixes  q  in  one  direction^  not  down- 
gradient.  Thus 

cos^0  —  sin0cos0\ 
y  ““  sin  0  cos  6  sin^  6  ) 

and  6  is  the  angle  the  instability  wavevector  makes  with 
the  x-axis.  The  approach  is  usually  written  in  terms  of 
the  quasi-Stokes  streamfunction.  Many  other  sugges¬ 
tions  appear  in  the  literature,  mostly  imtested  {Mar¬ 
shall^  this  volume-MAYBE)  shows  a  different  approach, 
which  is  tested  in  simple  physical  situations). 

2.2  Inferences  from  eddy-resolving  simulations 

There  have  been  several  efforts  to  use  eddy-resolving 
computations  to  enlighten  choices  about  eddy  parame- 
terisation.  To  date,  these  have  produced  not  completely 
consistent  results. 

Lee  et  aL  (1997)  indicate  that  q  is  fluxed  in  a 
three-layer  channel  model,  and  not  layer  thickness. 
Roberts  and  Marshall  (2000),  however,  find  in  a  depth- 
co-ordinate  model  that  the  divergent  part  of  the  tem¬ 
perature  flux  is  not  well  correlated  with  mean  temper¬ 
ature  gradient;  the  equivalent  for  q  is  moderately  well 
correlated,  as  are  the  eddy-induced  transport  velocity. 
Wilson  (2000),  however,  in  a  three-layer  channel  model 
with  forcing  varying  downstream,  finds  varying  agree¬ 
ment  with  both  thickness  and  q  flux  (thickness  having 
better  agreement  than  q)  and  large  areas  in  the  middle 
layer  in  which  the  fluxes  are  not  correlated  with  either 
gradient,  suggesting  the  importance  of  the  rotational 
component.  This  latter  is  emphasised  by  Drijfhout  and 
Hazeleger  (2001)  in  a  gyxal  eddy-resolving  model;  they 
show  that  the  zonally  averaged  northward  mean  thick¬ 
ness  gradients  are  well  correlated  with  the  zonally  aver¬ 
aged  divergent  eddy  thickness  fluxes,  while  the  equiva¬ 
lent  for  q  is  much  less  well  correlated. 


Estimates  of  eddy-induced  diffusivities  also  vary. 
There  are  strong  suggestions  that  diffusivity  varies  ver¬ 
tically  {Treguier  1999;  Robbins  et  al.j  2000)  as  well  as 
theoretical  suggestions  of  lateral  variation  ( Visbeck  et 
ai.,  1997;  Killworth,  1997). 

The  boundary  conditions  on  surface  and  floor  of  the 
TRM  streamfunction,  which  should  apparently  be  zero 
values  in  both  locations,  also  remain  unclear.  Both 
Treguier  (1999)  and  Gille  and  Davis  (1999)  estimate 
the  streamfunction,  which  takes  extreme  values  at  both 
top  and  bottom  of  the  channels  they  considered. 

There  is  no  strong  evidence  that  any  single  glob¬ 
ally  tested  parameterisation  (a)  gives  similar  fluxes 
(rotational  and  divergent?)  to  observed  values  from 
eddy-permitting/resolving  models  or  (b)  uniformly  im¬ 
proves  water  masses  and  tracers.  GM  has  by  far  the 
largest  suite  of  tests,  and  while  the  evidence  is  clear 
that  -  mostly  -  temperature/salinity  distributions  are 
favourably  affected  by  its  use  (e.g.  Knutti  et  al,  2000 
for  a  2.5-dimensional  model),  the  response  of  other  trac¬ 
ers  deteriorates  {England  and  Rahmstorf,  1999). 

This  should  not  be  surprising:  physical  eddies  have 
many  effects,  and  the  extant  parameterisations  are 
aimed  at  a  small  subset  of  those  effects. 

3.  Some  details  about  TRM 
streamfunctions 

3.1  Differences  between  EMD  and  MMD 

The  TRM  approach  is  to  work  from  density  co¬ 
ordinates  to  locate  what  “density”  variable  is  conserved 
by  a  flow  consisting  of  a  mean  (u)  plus  a  quasi-Stokes 
velocity  (u"^).  McDougall  (1998)  shows  that  this  den¬ 
sity  (the  modified  mean  density)  is  the  inversion  of  the 
average  height  of  a  density  surface, 

^  p{x,y,z,t) 

for  some  averaging  operator.  Then 

+ V*  {(u  +  u‘^)/5}  =  0, 

MM  show  that  for  small  amplitude  variability,  and 
to  second  order  accuracy, 

p  =  p  +  p  =  p  +  Oia^) 


Pz  Pz  \Pz) 


where  the  second  term  is  usually  small  and  will  be  ne¬ 
glected  here. 
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These  formulae  work  well  except  near  the  surface  and 
floor, ^wheve  there  are  first-order  differences  between  p 
and  p.  These  differences  are  produced  by  the  advection 
of  fluid  laterally:  any  fluid  of  a  light  density  which  is 
ever  present  at  some  (x,  y)  has  an  entry  in  the  modi¬ 
fied  mean  density  The  differences  occur  over  a  distance 
which  is  first-order  in  perturbation  amplitude.  Fig.  1 
shows  the  near-surface  behaviour  for  a  flow  whose  den¬ 
sity  is  given  at  some  horizontal  location  by 

z  ~  zo  =  F{p  —  pq)  -f  aG{p  —  poj  t) 

G  =  0. 

(The  solution  is  for  G{t)  =  sint.)  The  behaviour  is 
caused  by  time-averaging  near  surface  or  floor  only  be¬ 
ing  valid  when  the  resulting  depth  z  is  within  the  fluid. 

The  length  scale  in  Fig.  1  is  proportional  to  eddy 
amplitude.  For  linear  theory,  it  appears  as  a  delta- 
function  boundary  layer.  When  the  eddies  have  finite 
amplitude,  the  vertical  length  scale  over  which  the  two 
densities  differ  noticeably  is  of  order 

z'  ~  p' Ipz  ^  a  \Vhp/Pz\ 

which  is  the  typical  vertical  excursion  made  when  mov¬ 
ing  a  short  horizontal  distance  (u)  along  a  mean  isopy- 
cnal  which  moves  significantly  vertically  only  on  the 
gyre  scale  {L  »  a).  This  depth  is  small  for  ocean, 
though  not  for  the  atmosphere.  Even  with  fairly  opti¬ 
mistic  estimates,  it  is  hard  to  produce  a  vertical  scale 
much  larger  than  20  m.  So  the  distance  over  which  the 
MMD  and  EMD  differ  significantly  is  not  resolved  in 
most  climate  models,  being  concentrated  in  the  last  grid 
point  (save  in  regions  such  as  the  Antarctic  Circumpolar 
Current).  Thus  the  near-boundary  differences  between 
the  two  mean  densities  will  probably  appear  to  climate 
models  as  single  grid-point  effects,  i.e.  delta  functions. 
McIntosh  and  McDougall  (1996)  show  diagnostics  from 
FRAM  which  support  this. 

No  parameterisation  of  which  I  am  aware  succeeds 
in  reproducing  the  MMD  even  given  adequate  vertical 
resolution.  Fig.  2  shows  channel  model  results  for  a 
4-year  and  along-channel  average  of  an  eddy-resolving 
calculation,  with  surface  relaxation  and  parameters  de¬ 
signed  to  permit  the  lighter  near-surface  and  denser 
near-bottom  fluid  present  in  the  MMD  to  be  resolved  by 
the  10-m  grid  spacing  (the  relevant  depth  being  50-60 
m).  Also  shown  is  a  two-dimensional  calculation  using 
GM  (other  parameterisations  will  give  similar  poor  re¬ 
sults)  .  It  is  clear  that  the  “pushing  forward”  of  warm 
isopycnals  present  in  the  MMD  is  not  present.  Dif¬ 
ferences  are  very  small  at  the  lower  boundary  because 
eddy  amplitudes  were  small  there  also.  The  presence  of 


(p  -  Po) 


Figure  1.  The  differences  between  Eulerian  mean  and  mod¬ 
ified  density.  The  upper  diagram  shows  that  the  densities 
are  very  close  to  each  other  in  the  fluid  interior  (differing  by 
0(a  ),  where  a  is  the  small  amplitude  of  the  fluctuations). 
In  a  zone  of  size  a  near  surface  and  floor,  the  two  densities 
differ  by  a  much  larger  amount,  0(a),  as  indicated  in  the 
exploded  lower  view  (which  is  actually  the  exact  solution 
for  sinusoidal  time  variation  and  imiform  interior  density 
greidient). 


a  mixed  layer  (not  treated  here)  makes  no  difference  to 
this  argument,  since  it  merely  moves  the  region  where 
EMD  and  MMD  differ  slightly  lower  (usually  to  worse 
resolution). 

Because  the  resolution  does  not  permit  the  resolution 
of  the  layer  where  MMD  and  EMD  differ,  not  all  the 
eddy  fluxes  can  be  represented.  Fig.  3  shows  the  the 
physical  near-surface  situation  contrasted  with  what  is 
repr^ented  in  a  coarse-resolution  model.  At  the  least 
density  which  the  model  is  capable  of  representing,  the 
quasi-Stokes  streamfunction  is  non-zero  (its  value,  cor¬ 
rectly,  representing  the  lateral  eddy  flux  within  the  un¬ 
resolved  layer,  which  thus  appears  as  a  delta-function). 

An  argument  can  be  made  that  one  could  relax  the 
condition  of  =  ^2  =  0  on  surface  and  floor  since  the 
model  being  run  (a)  cannot  distinguish  the  EMD  and 
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Eulerian  mean  temperature 


Modified  mean  temperature 


GM90  mean  temperature 


what  the  coarse  model  sees 


Figure  3.  Misrepresentation  of  quasi-Stokes  fluxes  when 
the  shallow  layer  is  not  resolved  by  the  model 

the  MMD  and  (b)  does  not  resolve  the  missing  la3?er. 
This  is  investigated  below. 

3.2  Mass  and  available  potential  energy 

The  differences  between  the  two  densities  have  two 
important  effects.  The  first  is  directly  concerned  with 
the  interpretation  of  mean  density.  It  is  straightforward 
to  see  that  the  low-pass  time  filtered  net  mass  in  a  water 
column,  which  is  a  uniquely  defined  value,  is  the  same 
whether  EMD  or  MMD  is  used: 


Figure  2.  The  Eulerian  and  modified  mean  density  for 
a  4-year  and  along-channel  average  of  an  eddy-permitting 
channel  model  discussed  in  the  text.  (The  average  over  the 
previous  4  year  period  is  almost  identical.)  The  problem  was 
chosen  to  provide  a  larger  vertical  range  over  which  the  EMD 
and  MMD  differ  than  would  hold  for  the  real  ocean,  so  that 
the  vertical  resolution  (10  m)  was  adequate.  Also  shown 
is  a  typical  two-dimensional  parameterisation  steady-state 
result,  in  this  case  following  Gent  and  McWilliams  (1990), 
using  an  eddy  diffusion  of  2000  m^  s“^  While  the  latter 
does  not  reproduce  the  EMD  particularly  well  (true  for  a 
wide  range  of  diffusivities),  it  does  not  reproduce  the  MMD 
at  all  where  this  differs  from  the  EMD.  This  appears  to  hold 
for  most  extant  parameterisations. 


/  =  J  pdz  (averaging  at  constant  depth) 

~  J  ~  J 

(averaging  at  constant  density). 


(As  Fig.  1  suggests,  p  is  lighter  at  the  surface,  but  the 
shortfall  is  made  up  at  the  floor.) 

The  same  does  not  hold  for  potential  energy,  because 
of  the  noncommutative  averaging  operators  on  products 
of  quantities.  For  small  amplitude,  the  differences  be¬ 
tween  EMD  and  MMD  potential  energies  are  O(a^), 
and  occur  due  to  0(q:^)  differences  in  the  interior  over 
a  depth  range  of  order  unity,  and  0(a)  differences  over 
0(a)  depth  ranges. 
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It  is  shown  in  Killworth  (2001)  that 

APjE  =  f  z{p  —  p)dz  <  0. 
J-H 


The  difference  between  the  two  PE  expressions  lies 
in  the  variability,  fundamentally  a  part  of  the  MMD.  It 
involves  an  integral  in  density  space  of  the  mean  square 
depth  fluctuations;  the  proof  is  straightforward,  either 
from  the  McDougall  (1998)  formulae  or  by  direct  eval¬ 
uation,  and  is  not  given  here. 

Thus  the  low-pass  filtered  potential  energy  of  a  fluid 
column  (a  uniquely  defined  quantity)  is  only  correctly 
evaluated  using  EMD,  and  is  consistently  underesti¬ 
mated  using  the  MMD;  correction  terms  can  be  de¬ 
rived,  and  involve  knowledge  of  the  variability.  This 
implies  that  energetics  cannot  consistently  be  produced 
for  coarse  models  including  parameterisations. 


4.  Creating  a  parameterisation 


The  Gent  and  McWilliams  (1990)  parameterisation 
has  the  virtue  of  simplicity  and  elegance,  although  to 
function  globally  various  modifications  have  to  be  made, 
typically  those  of  ‘tapering’  neutral  density  slopes  when 
they  become  too  large.  Other  parameterisations  are 
usually  less  elegant,  and  so  need  more  modifications. 
The  depth-co-ordinate  version  of  Killworth  (1997)  is 
briefly  discussed  here  before  comparisons  are  made  in 
channel  and  global  geometries. 

The  parameterisation  uses  linear  theory,  following 
Robinson  and  McWilliams  (1974).  Linear  theory  is  not 
always  a  good  predictor  of  the  behaviour  of  a  nonlinear 
eddying  system,  as  Edmon  et  al  1980)  demonstrate, 
but  it  makes  an  indicative  starting  point  (and  holds  in 
the  same  parameter  range  as  the  McDougall  (1998)  for¬ 
mulae).  The  system  is  assumed  to  be  slowly  varying 
in  the  horizontal,  compared  with  the  local  deformation 
radius.  The  equations  for  small  perturbations  varying 
as  exp  ik{x  cos  0  +  y  sin  0  —  ct)  are 


Qy  =  0cos6  —  (^“z)  (91°  direction  6) 

u  =  u  cos  0  +  u  sin  0  (velocity  in  direction  0) 

In  the  case  of  a  channel  geometry,  0  is  identically 
zero. 

All  quantities  can  be  expressed  in  terms  of  the  pres¬ 
sure  perturbation,  for  example: 


v'p' 


A:cos0 

2/po0 


Re(ipPz);  '<p2  =  - 


kcosO 

2/^ 


Re(ipp*) 


diffusivity  k  — 


kci 

p 

to 

'-s 

to 

oto 

u  —  c 

^  sin  0  Qy .  ^4-  _  ^  cos  0  qy 


Thus  to  estimate  all  that  is  needed  is  an  approxi¬ 
mate  solution  to  the  instability  problem.  The  deforma¬ 
tion  radius  is  estimated  from 

1  7° 

C  =  —  /  N{z)  dz^  a  =  C/  f 
J-H 


and  wavenumber  from  the  Eady  (1949)  result 
k  =  0.51/a. 


The  angle  0  is  estimated  from  a  crude  solution  max¬ 
imising  the  growth  rate  based  on  standard  deviations 
of  u  and  v  over  depth.  As  shown  by  Gill  et  al  (1974), 
0  is  small  for  east-west  flow  as  in  the  channel  case,  but 
is  near  7r/2  if  u  is  small  and  v  nonzero  except  in  high 
latitudes  where  /?  is  small.  The  approximate  vertical 
structure  of  «  and  p  is  found  by  two  iterations  of  an  apv- 
proximate  (small  wavenumber)  solution  of  the  problem, 
and  K  is  scaled  by  Aacj.  Here  A  is  an  0(1)  constant: 
3  is  found  to  be  optimal.  The  use  of  Ci,  the  imaginary 
part  of  the  phase  speed,  ensures  that  there  is  no  mixing 
where  flow  is  stable,  though  there  is  always  instability 
when  V  is  non-zero,  as  Pedlosky  (1987)  shows. 

The  resulting  streamfunction  possesses  two  terms. 
The  first  is  -k  times  the  slope  of  the  isopycnals  nor¬ 
mal  to  the  angle  of  instability,  which  has  similarities  to 
the  GM  formulation.  The  second,  extra  term,  has  no 
such  easy  interpretation.  Here  the  diffusivity  k  varies 
three-dimensionally. 


5.  Channel  model  comparisons 

The  discussion  in  Section  3  suggests  that  the  surface 
and  floor  conditions  on  the  TRM  in  coarse  resolution 
models  are  not  obvious  (Fig.  2  being  a  good  example). 
This  section  examines  solutions  to  two-dimensional  em¬ 
ulations  of  the  three-dimensional  channel  model  of  Kill- 
worth  (1998),  using  a  variety  of  formulations  to  repre¬ 
sent  the  eddy  terms,  specifically  to  examine  the  bound¬ 
ary  condition  question. 

The  model  covered  a  longitude  range  of  2.6°,  a  lat¬ 
itude  range  of  5.2°,  centred  on  30°N,  and  a  shallow 
depth  of  300  m.  The  grid  spacings  were  0.02°  east- west, 
0.018°  north-south  and  20  m  vertically,  with  viscosities 
50  m^  s^"^  (horizontal)  and  5  x  10*”^  m^  s'”  ^vertically 
and  diffusivities  10  m^  s“ ^horizontally  and  lO”'^  verti¬ 
cally.  Starting  from  a  narrow  temperature  front  with 
uniform  salinity,  relaxation  towards  the  initial  temper¬ 
ature  values  in  bands  at  north  and  south  of  the  chan¬ 
nel  provided  a  source  of  potential  energy.  This  method 
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has  the  advantage  that  there  are  no  regions  of  unstable 
or  neutral  stratification,  thus  avoiding  diflSculties  about 
parameterisations  in  such  regions.  Averages  were  com¬ 
puted  over  time  and  longitude  over  7.25  years  between 
days  300  and  2950. 

Two-dimensional  (latitude-depth)  simulations  were 
then  run  using  a  variety  of  two-dimensional  parameter¬ 
isations  on  a  Cartesian  gnd,  and  the  4000-day  compu¬ 
tations  (steady  in  almost  all  cases)  compared  with  the 
averages  from  the  three-dimensional  run.  Comparisons 
were  made  with  the  temperature  field  as  a  function  of  y 
(north)  and  2,  and  with  the  baroclinic  u  velocity.^  The 
comparisons  are  not  ideal.  Like  other  published  work, 
they  are  of  Eulerian  means  only,  and  over  a  period  prob¬ 
ably  an  order  of  magnitude  too  short  for  a  good  statis¬ 
tical  comparison.  (However,  the  fields  in  Fig.  3  were 
visually  imaltered  by  averaging  over  another  period  of 
similar  length,  so  the  statistics  may  be  better  than  we 
suggest.) 

Comparisons  were  made  both  visually  and  using  a 
stringent  measure  of  explained  variance  due  to  Visbeck 
et  al  (1997).  In  each  case,  any  free  parameters  in  the 
parameterisation  were  adjusted  to  maximise  the  agree¬ 
ment  with  the  eddy-resolving  average.  No  parameter¬ 
isation  reproduced  the  ‘pushing  forward’  of  isopycnals 
in  the  MMD,  so  that  direct  comparisons  with  it  are  not 
useful. 

Fig.  4  shows  the  three-dimensional  time-  and  along- 
channel-averaged  solution.  To  provide  a  yardstick  for 
the  various  parameterisations,  Fig.  5  shows  the  two- 
dimensional  temperature  field  using  only  advection  by 
the  actual  (two-dimensional)  velocity  fields  plus  a  hori¬ 
zontal  diffusivity  of  200  m^  which  clearly  gives  re¬ 
sults  very  close  to  the  three-dimensional  results. 

The  other  parameterisations  used  were  (in  order  of 
appearance  in  Figs.  6-10)  the  following: 

GM90  {Gent  and  McWilliams,  1990,  which  has  a  con¬ 
stant  diffusivity);  Fig.  6 

K97  (more  properly,  the  depth  co-ordinate  version  of 
Killworth,  1997,  discussed  earlier,  which  computes 
a  variable  diffusivity);  Fig.  7 

GMs  {Gent  and  McWilliams,  1990,  but  with  the  stream- 
function  non-zero  at  the  surface);  Fig.  8 

Ks  {Killworth  1997,  adapted  as  discussed  below);  Fig.9 

VP  (computing  {v'p%  directly  from  small-amplitude 
formulae,  also  discussed  below);  Fig.  10 

^As  discussed  by  Killworth  (1998),  the  two-dimensional  runs 
have  no  depth-averaged  u  field,  so  that  only  the  baroclinic  u  can 
be  compared.  The  barotropic  u  field,  as  noted  by  Killworth,  plays 
a  not  Inconsiderable  role  in  the  dynamics. 


Figure  4.  Contours  of  temperature  (°C;  contour  inter¬ 
val  0.5°  C)  for  the  time-  and  along-channel-averaged  three- 
dimensional  eddy-resolving  calculation. 


Figure  5.  Two-dimensional  simulation  of  Fig.  4,  using  a 
horizontal  difiusivity  of  200  m^  s~^ 


Latitude  (deg) 


Figure  6.  As  Fig.  4,  but  using  the  Gent  and  McWilliams 
(1990)  parameterisation,  with  k  =  160  s~^ 
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Latitude  (deg) 

Figure  7.  As  Fig.  4,  but  using  the  Killworth  (1997)  pa- 
rameterisation  with  A  — 3. 


Figure  8.  As  Fib.  4,  but  using  the  Gent  and  McWilliams 
parameterisation  modified  so  that  the  streamfunction  does 
not  vanish  at  surface  or  floor,  with  k  =  1200  m^  s“^ 


Figure  9.  As  Fig.  4,  but  using  the  Killworth  (1997)  pa¬ 
rameterisation,  modified  so  that  the  streamfunction  does  not 
vanish  at  surface  or  floor,  with  A  =  5 


Figure  10.  As  Fig.  4,  but  using  a  direct  parameterisation 
of  {v'p')y  directly  from  linear  theory,  with  A  =  3 


The  GM  parameterisation  is  handled  as  in  the  MOMS 
code  {Pacanowski  and  Griffies^  1999),  with  the  TRM 
streamfunction  dropping  to  zero  over  the  last  grid  point. 
The  diffosivity  k  is  taken  as  a  constant.  The  K97  pa¬ 
rameterisation  is  as  discussed  earlier.  The  third  param¬ 
eterisation,  GMs,  attempts  to  emulate  a  nonzero  value 
of  streamfunction  at  surface  and  floor.  This  is  not  an 
easy  task  numerically,  since  many  apparently  straight¬ 
forward  approaches  generated  numerical  instabilities. 
These  included  extrapolation  of  either  the  isopycnic 
slope  or  the  streamfunction  to  the  boundary,  and  com¬ 
putation  of  boimdary  values  using  one-sided  interpola¬ 
tion  formulae.  A  slightly  unsatisfactory  approach  which 
set  the  streamfunction  at  surface  (floor)  to  the  values 
immediately  below  (above)  was  eventually  used;  the  dis¬ 
advantage  being  that  the  field  vanished  in  the  top 
and  bottom  grid  points.  The  fourth  parameterisation, 
Ks,  does  the  same  thing  for  K97. 

The  last  parameterisation,  for  the  EMD,  directly 
evaluates  {v^p')y  (very  similar  answers  are  found  for 
+  (u;'p')z)  directly  from  small-amplitude  theory, 
again  using  KUlworth's  (1997)  scalings.  Neither  calcu¬ 
lation  requires  boundary  conditions  since  the  normal 
velocity  must  vanish  at  boundaries.  Direct  attempts 
to  parameterise  the  flux  divergence  usually  suffer  from 
Veronis  effects  (Veronis,  1975);  however,  this  approach 
does  not,  since  the  terms  have  the  same  conservation 
properties  (for  the  flux  terms)  as  the  original  system. 

The  most  accurate  version  of  the  GM90  parameter¬ 
isation  for  this  problem  has  a  «  of  160  m^  s“^,  a  little 
lower  than  that  cited  in  Killworth  (1998).  The  results 
for  the  GM90  (Fig.  6)  are  very  similar  to  those  of  pure 
diffusion  (5),  although  slightly  less  accurate  in  the  u 
field.  The  similarity  is  surprising  since  the  GM90  in¬ 
cludes  the  strong  northward  (southward)  advection  near 
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the  surface  (floor)  which  is  not  present  in  the  simple  dif¬ 
fusive  case. 

The  most  accurate  version  of  the  K97  parameterisa- 
tion  (Fig.  7)  has  .4  =  3,  as  used  in  Killworth  (1998)  for 
the  same  problem.  As  Fig.  7  shows,  this  paxameteri- 
sation  is  the  only  one  to  produce  the  ‘doming’  of  the 
15.5®  isotherm  near  the  northern  boundary  with  any 
accuracy. 

If  is  not  required  to  vanish  at  surface  and  floor, 
then  for  this  geometry  the  parameter  values  used  hith¬ 
erto  are  insufficient  to  reproduce  the  three-dimensional 
solution.  This  is  because  the  high  northward  advec- 
tion  neax-surface  is  now  lacking.  For  the  GM90  pa- 
rameterisation  (Fig.  8),  k  needed  to  be  increased  an 
order  of  magnitude  (to  1200  m^  s“^)  in  order  to  repro¬ 
duce  an  approximation  to  the  three-dimensional  fields. 
Although  the  temperature  field  looks  reasonable,  the 
corresponding  velocity  is  poorly  reproduced,  due  to  the 
strong  surface  front  near  the  southern  botmdaxy.  A  sim¬ 
ilar  finding  holds  for  the  K97  parameterisation  (Fig.  9). 
Thus  permitting  non-zero  w'^  at  surface  and  floor  has 
not  achieved  a  higher  accuracy  than  maintaining  zero 
for  this  problem  and  choice  of  parameterisations. 

The  final  parameterisation  (VP)  does  not  use  the 
(v“^,  ![;■*“)  formulation,  but  simply  inserts  a  parame¬ 
terisation  for  mixing  directly.  The  results  (Fig.  10) 
again  show  an  accurate  representation  of  the  three- 
dimensional  result. 

In  terms,  then,  of  reproducing  the  Eulerian  mean 
density,  most  schemes  were  successful.  The  K97  and 
VP  were  only  marginally  superior  to  the  others,  and 
schemes  which  permitted  nonzero  quasi-Stokes  stream- 
functions  at  the  surface  were  quite  inferior. 

6.  Global  parameterisations 

Although  various  theoretical  suggestions  for  param¬ 
eterisations  have  been  made,  relatively  few  have  been 
tested  in  global  models.  GM  has  been  shown  to  be  ro¬ 
bust  (provided  that  tapering  arrangements  are  included 
to  prevent  poor  behaviour  in  weakly  stratified  environ¬ 
ments).  The  K97  parameterisation  is  here  extended  to 
work  globally,  within  the  MOMS  code  (Pacanowski  and 
GriffieSf  1999).  This  represents  a  TRM  streamfunction 
as  an  extra  term  within  the  3  x  3  isopycnic  mixing 
tensor,  as 

(  Ai  0  AiSx  +  \  ( Px\ 

0  Ai  AiSy  +  V'2  1  Py 

\ AjSx  -  -01  AjSy  -  -02  AjS^-\-AdJ\PzJ 

where  5^,5^  are  the  isopycnal  slopes  in  the  x  and  y 
directions,  Aj  is  the  isopycnal  diffusion,  and  Ad  the 
diapycnal  diffusion. 


First,  there  is  also  an  element  of  tapering.  In  the  ex¬ 
pressions  for  01  and  02  (not  shown  here)  the  first  term, 
as  noted  earlier,  includes  an  expression  which  is  the 
product  of  the  diffusivity  and  the  isopycnal  slope  nor¬ 
mal  to  the  orientation  of  the  fastest  growing  wavenum¬ 
ber.  This  slope  is  tapered  in  precisely  the  same  way  as 
in  isopycnal  mixing.  Second,  because  the  K97  formula¬ 
tion  is  essentially  quadratic  in  shear  {k  increases  with 
shear,  and  operates  on  the  shear)  rather  than  hnear  as 
in  GM,  an  abrupt  start,  e.g.  from  observed  tempera¬ 
ture  and  salinity,  could  lead  to  instabilities.  The  diffu¬ 
sivity  -  and  hence  streamfunctions  -  within  a  column 
axe  rescaled  if  necessary  so  that  k  <  Kmax  =  lO^m^  8“^. 

The  role  of  baxoclinic  instability  near  the  equator 
is  unclear.  While  GM  does  not  depend  on  position, 
any  scheme  involving  q  mixing  must  make  choices  near 
equator  which  reflect  (a)  that  the  role  of  baxoclinic  in¬ 
stability  is  less  near  the  equator  than  at  mid-latitude 
and  (b)  that  it  is  the  east-west  density  gradient  which 
is  predominantly  creating  flows,  and  so  should  be  pref¬ 
erentially  decreased  by  eddies.  In  addition,  preUminaxy 
results  analysing  OCCAM  output  {de  Vries  and  Drijf- 
hout,  personal  communication)  suggest  that  the  TRM 
streamfunction  is  much  larger  near  the  equator  than  a 
simple  constant-diffusion  GM  would  predict. 

In  any  event,  an  engineering  ‘fix’  is  required.  At 
present,  the  Coriolis  parameter  /  is  taken  as  the  maxi¬ 
mum  of  /  and  some  fmin  (N.  Hemisphere  sign  conven¬ 
tion),  and  the  deformation  radius  is  defined  from  the 
mid-latitude  and  equatorial  values  by 

o^mid  =  C/  |/|  ;  a^q  =  \/(C/2/3); 

a  =  TIX\li{amid-)  ^eg)?  ^  “  0.51/ci7nid 

which  both  avoids  large  wavenumbers  and  permits  the 
approximate  solution  to  continue  to  function  reason¬ 
ably. 

The  last  modification  is  that  velocity  shears  are  con¬ 
sistently  used  in  the  calculation  instead  of  density  gra¬ 
dients,  since  thermal  wind  fails  near  the  Equator.  Near 
the  surface  these  include  the  Ekman  contribution.  This 
is  removed  in  the  surface  layer  (the  model  has  no  mixed 
layer)  by  extrapolating  the  effective  surface  velocity 
from  the  next  two  depths. 

The  above  changes  have  been  implemented  in  MOM3, 
and  run  in  a  2®  X  2®  near-global  configuration  (77®S  - 
77®N)  for  one  year  only,  since  various  features  of  the 
paxameterisation  remain  under  experiment.  Monthly 
windstress  and  surface  forcing  were  employed  using 
standard  MOM3  options.  Typical  parameter  values 
were  used,  including  an  isopycnal  diffusivity  of  10^ 
m^  s""^.  A  parallel  rim  used  the  GM  formulation,  which 
is  an  option  in  MOM3,  with  n  =  10^  m^  s“^.  This  failed 
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after  3  months  with  erroneously  high  velocities  in  the 
Arctic.  Results  from  the  last  snapshot  files  are  shown 
from  both  runs.  Normal  diagnostics  such  as  overturn¬ 
ing  streamfunction  are  not  useful  with  such  short  runs, 
and  for  reasons  of  space  only  one  elementary  diagnostic 
is  given  here. 

Figures  11  and  12  show  the  near-surface  (level  1)  hor¬ 
izontal  TRM  velocities  superimposed  on  temperature 
contours  for  K97  and  GM  at  the  second  level  (depth 
37.5  m)  of  the  model,  at  the  respective  ends  of  their 
runs.  Interestingly,  neither  set  of  TRM  velocities  are 
particularly  small  compared  with  the  mean  fiow.  The 
K97  TRM  flow  is  predominantly  east-west  at  latitudes 
less  than  about  40®.  This  is  caused  by  the  north-south 
orientation  of  the  fastest  growth  rate,  at  least  with  the 
approximation  used  here.  Another  approximation,  cast¬ 
ing  the  continuous  flow  onto  a  2-layer  system  and  max¬ 
imising  growth  rate  over  wavenumber  and  angle,  fol¬ 
lowed  by  using  this  in  the  vertical  iteration  scheme  used 
normally  in  K97,  gives  similar  answers.  It  is  possible 
that  both  approximations  miss  other,  stronger  growth 
rates  at  higher  wavenumber.  Gill  et  al  (1974)  give 
some  examples,  but  the  indication  from  their  work  and 
from  Pedlosky^s  (1987)  book  is  that  the  orientation  is 
expected  to  be  close  to  north-south  under  most  circum¬ 
stances,  as  Fig.  11  suggests.  Thus  the  restriction  of  the 
TRM  flow  to  represent  q  mixing  along  a  single  orientar 
tion  only  yields  TRM  flows  which  are  in  the  east- west 
direction  in  midlatitudes  (despite  the  fact  that  density 
gradients  are  for  the  most  part  much  stronger  north- 
south  than  east- west  in  ocean  data).  Only  in  the  subpo¬ 
lar  areas  is  the  ^-effect  reduced  stifiiciently  that  north- 
south  TRM  flows  appear,  e.g.  in  the  Antarctic  Circum¬ 
polar  Current.  However,  the  equatorial  TRM  flows  in 
Fig.  11  are  smooth  and  well-behaved,  in  contrast  to 
the  GM  flows  in  Fig.  12  which  are  rather  noisy.  In  the 
subtropics,  both  K97  and  GM  have  weak  TRM  flows. 
However,  in  the  subpolar  region  the  GM  TRM  flows  are 
more  uniform  and  slightly  stronger  poleward  than  their 
K97  counterparts. 

7.  Discussion 

The  eddy  parameterisation  issue  is  far  from  resolved 
for  many  reasons.  Even  if  the  belief  structure  is  adopted 
that  mixing  is  done  by  eddies,  it  is  unclear  what  should 
be  mixed.  The  role  of  divergent  and  rotational  fluxes 
remains  unclear.  Coarse  models  do  not  usually  possess 
the  vertical  resolution  to  distinguish  Eulerian  and  MM 
density  near  the  surface  and  floor,  save  in  weakly  strat¬ 
ified  regions. 

Nonetheless,  simple,  clean  parameterisations  such  as 
GM  seem  to  improve  much  (but  not  all)  of  a  coarse 
model  response.  There  are  indications  that  mixing  co- 
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Figure  11.  Global  K97  results  for  near-surface  temperature 
and  horizontal  TRM  fluxes,  after  1  year’s  integration 
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Figure  12.  GM  equivalent,  after  3  months’  integration 
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efficients  vary  spatially,  but  it  is  unclear  whether  this  is 
0(1)  in  importance;  recall  that  the  channel  simulations 
showed  almost  indistinguishable  results  between  GM, 
with  a  constant  difFusivity,  and  K97,  where  the  difFu- 
sivity  varied  by  two  orders  of  magnitude.  Similarly, 
more  complicated  parameterisation  schemes  appear  to 
function  as  well  as  (sometimes  better,  sometimes  not) 
simple  schemes.  Again  it  is  unclear  how  important  the 
simple-complex  distinction  is.  Certainly  it  is  not  proven 
that  any  complicated  scheme  performs  better  globally 
than  a  simple  one. 

In  the  near  future,  careful  intercomparisons  of  differ¬ 
ent  schemes  against  trusted,  statistically  reliable  three- 
dimensional  eddy-resolving  calculations  will  be  required. 
This  needs  the  community  to  invest  in  fine-resolution 
closed-gyre  forced  and  free  calculations  run  to  some 
form  of  equilibrium  to  use  as  the  testbeds  for  param- 
eterisations.  Some  of  these  need  to  be  explicitly  time- 
G.g.  on  seasonal  scales.  (Should  seasonal  varia¬ 
tion  be  included  within,  or  without,  the  averaging  inter¬ 
val  for  eddy  fluxes  when  applied,  say,  to  annual  mean 
coarse  resolution  modelling?)  The  community  should 
also  be  encouraged  to  seek  other  ways  of  casting  the 
parameterisation  problem  (e.g.,  the  approach  by  Mar¬ 
shall  in  this  volume). 

Acknowledgments.  My  thanks  to  George  Nurser  for 
parts  of  Fig.  2. 
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Are  there  principles  to  guide  eddy  parameterizations? 

Greg  Holloway 

Institute  of  Ocean  Sciences,  Sidney,  B.C.,  Canada 


Abstract.  A  “list  as  long  as  your  arm”  has  described  the  range  of  principles  that 
may  he  attempted  for  possible  eddy  parameterization  schemes.  We  feel  uncom¬ 
fortably  an  attitude  of  “try  and  see”  whether  any  particular  principle  “works”  in 
any  particular  application.  This  note  summarises  a  discussion  which  followed 
the  body  of  presentations  at  ‘Aha.  We  ask  if  principles  from  general  physics, 
especially  notions  of  the  2"^*  Law  and  entropy,  can  help  clear  a  way.  We  ask  if 
such  ideas  offer  practical  means  to  advance  practical  knowledge,  and  where  ma¬ 
jor  impediments  may  lie. 


A  discussion 

These  pages  follow  notes  taken  throughout  ‘Aha 
Huliko’a,  and  from  ideas  discussed  in  part  during  the 
‘Aha  discussion  time.  The  issues  were  framed  by 
David  Marshall  and  further  by  Peter  Killworth.  Sur¬ 
veying  approaches  that  have  been  taken  to  providing 
a  basis  for  eddy  parameterization,  one  is  daunted  by 
the  length  of  the  list  and  the  tentative  character  of  the 
entries.  What  “should”  eddies  do?  On  the  list  we 
find  (as  examples) 

Eddies  should  flatten  isopycnals. 

Eddies  should  maximally  dissipate  APE  (available 
potential  energy). 

Eddies  should  mix  PV  (potential  vorticity). 

Eddies  should  mix  layers  thickness  along  layers. 

Eddies  should  maximally  dissipate  enstrophy. 

Eddies  should  reduce  MKE  (mean  kinetic  energy). 

Eddies  should  relax  toward  certain  rectified  (“nep- 
tune”)  flows. 

...  and  so  on. 

“Should”  eddies  do  any  of  this?  While  we  pose 
this  list  in  terms  of  eddy  parameterizations,  another 
‘Aha  topic — stratified  mixing — would  generate  yet 
another  list.  Are  we  dispirited?  To  “help,”  Bill 
Young  inserted  an  estimate  of  the  number  of  active 
degrees  of  freedom  in  the  ocean,  suggesting  10‘*  “per 
cell”  or  10^’  if  one  includes  biology.  (There  was  a 
little  dispute  about  numbers,  but  the  key  message  is 
the  numbers  are  “big” — ^far,  far  bigger  than  modem 
supercomputers  are  able  to  prognose,  which  are  more 
like  lO’  tolO’.)  Given  such  circumstances,  Walter 
Munk  asked  if  we  deem  the  situation  hopeless. 


Answering  Walter’s  question  depends  upon  what 
ocean  modelling  seeks  to  do.  If  our  modelling  pro¬ 
ject  amounts  to  trying  to  invent  a  steam  engine  from 
molecular  dynamics  simulation  of  water  vapour,  it 
may  well  be  hopeless  in  our  lifetimes.  If  we  would 
invent  a  steam  engine  based  on  thermodynamic  func¬ 
tions,  in  part  from  empiricism  and  in  part  from  statis¬ 
tical  physics,  it  may  not  be  so  hopeless.  Importantly 
we  need  to  turn  the  huge  number  of  degrees  of  free¬ 
dom  from  threat  to  opportunity. 

When  we  recognize  that  we’ve  no  ability  nor  prac¬ 
tical  interest  to  know  the  ocean  in  all  its  10^^  (or 
whatever)  details,  we  naturally  turn  to  probabilities  of 
oceans.  Mel  Briscoe  asked  if  we  would  predict  evo¬ 
lution  of  probabilities  distributions  or  if  we  limit  at¬ 
tention  to  moments  (expectations)  from  those  prob¬ 
abilities.  Framing  issues  in  terms  of  moments  might 
render  the  task  manageable? 

Lessons  in  our  coffee  cups? 

A  difficulty  may  be  in  part  “cultural”  insofar  as 
we,  as  a  community,  have  little  orientation  toward 
statistical  physics,  basing  ocean  dynamics  instead  on 
the  classical  mechanics  of  GFD  amended  with  sundry 
by-guess-and-by-golly  mixing  coefficients.  During 
‘Aha  two  other  themes  recurred.  We  were  reminded 
of  the  oft-cited  stirring  cream  into  coffee.  And  we 
were  reminded  of  the  influence  Carl  Eckart  brought, 
seeking  to  base  physical  oceanography  upon  underly¬ 
ing  physics.  Although  we  cannot  invite  Carl’s  direct 
input,  we  might  seek  in  a  spirit  after  Carl  to  ask  why 
does  cream  in  coffee  turn  brown.  Here  I  only  substi¬ 
tute  my  own  comment.  I  should  hope  the  answer  is 
not  because  stirring  causes  enhanced  mixing  (diffu- 
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sion).  I  should  hope  the  answer  is  that  internal  inter¬ 
actions  within  the  coffee  cup  transition  the  probabili¬ 
ties  of  cream  and  of  coffee  to  a  distribution  with 
higher  entropy.  Practically,  the  useful  representation 
of  this  idea  may  well  be  that  stirring  leads  to  en¬ 
hanced  mixing.  We  see  this  as  a  result  following 
from  the  underlying  basis,  after  which  we  might 
quasi-empirically  parameterize  cups,  teaspoons, 
manners  of  agitation,  etc. 

In  the  case  of  stirring  cream  into  coffee,  one  is 
quite  inclined  merely  to  nod  to  the  entropy  discussion 
before  proceeding  directly  to  parameterizing  the  stir¬ 
ring-mixing.  Were  the  topic  of  ocean  eddies  this 
simple,  we  would  hardly  speak  of  so  simple  a  matter 
at  ‘Aha  Huliko’a.  But  eddies  are  not  simple.  Then 
stirring-mixing  intuitions,  post-hoc  modified  by  crite¬ 
ria  such  as  listed  at  the  outset  of  this  note,  are  notori¬ 
ously  unreliable.  Was  there  something  more  to  learn 
in  our  coffee  cups? 

Dynamics  of  moments  of  probable  seas 

The  following  story  is  not  yet  clear,  in  part  because 
the  methods  are  so  little  explored.  Here  I  make  a 
sketch,  indicating  some  research  directions,  practical 
results,  relations  to  issues  David  and  Peter  framed, 
and  current  challenges. 

First  we  embrace  the  idea  that  oceans  are  known 
only  in  probability.  The  detailed  state  of  the  ocean 
might  be  expressed  in  a  state  vector  y,  whose  dimen¬ 
sion  could  well  be  10^’,  or  whatever.  We  don’t  know 
y.  We  only  speak  of  elemental  probability  dP  = 
P{y)y  that  the  actual  value  of  y  falls  within  a  phase 
volume  dy  about  any  given  y.  So  far  the  discussion  is 
aerie-faerie.  What  we  really  would  like  to  access  are 
moments  of  P,  such  as  Y  =  y  dP  with  the  integral 
over  all  y.  Importantly,  the  dimension  of  Y  need  no 
longer  be  10^^.  We  can  “project  out”  as  much  of  y  as 
we  care  not  to  consider,  perhaps  taking  as  only 
“lumped”  (space-time  averages)  over  y.  (My  nota¬ 
tion  could  be  embellished!)  Dimensions  of  Y  might 
be  only  lO’  or  10^  or  maybe  only  10.  The  key  con¬ 
sideration  is  that  these  Y  are  moments  of  probabili¬ 
ties.  That  distinction  can  get  lost  when,  for  example, 
we  look  at  output  of  GCMs  and  see  maps  of  veloci¬ 
ties,  temperatures,  elevations  or  whatever.  Even 
when  we  admit  that  the  Y  are  grid-cell-averaged 
variables,  we  tend  still  to  speak  of  “velocity”  rather 
than  “velocity  moment  of  probability  of,”  for  exam¬ 
ple.  So  what? 


The  “so  what”  gets  us  when  we  write  dynamic 
equations.  Too  easily  we  look  up  equations  for  ve¬ 
locity  or  temperature  or  such  from  textbooks,  the 
only  ambiguity  arising  from  nonlinearities  which, 
averaged  over  space-time  volumes  need  some  closure 
“approximations”.  Were  we  to  ask  instead  for  the 
equation  of  motion  of  the  temperature  moment  of  the 
probable  state,  say,  we  might  (1)  grow  tired  and  (2) 
pause  on  our  way  to  the  textbook.  Until  we  are  clear 
what  are  the  dependent  variables  in  the  problem,  as¬ 
suming  equations  of  motion  is  premature. 

Next  steps  are,  in  part,  familiar.  Linear  terms  in 
equations  of  dy/dt  commute  with  expectation  and 
averaging  operators,  so  linear  terms  in  dV/dt  are  “as 
usual”.  Nonlinearities  in  dy/dt  can  be  expressed  in 
parts  as  corresponding  nonlinearies  among  compo¬ 
nents  of  Y,  which  again  may  look  familiar  following 
“usual”  Reynolds  averaging.  And  there  is  “more,” 
the  “stuff’  that  connects  the  Y  to  all  the  P{y)  which 
we  do  not  know.  dY/dt  =  f(Y)-(-X,  where  “f  ’  express 
the  “familiar”  equations  from  textbooks  and  “X”  are 
the  unknowns  (of  course). 

Two  routes  to  “X” 

The  question  of  “X”  should  be  seen  in  context  of 
nonequilibrium  statistical  mechanic,  a  gloriously  un¬ 
solved  problem.  There  are  two  avenues.  I  have 
tended  to  follow  Lars  Onsager,  seeing  in  “X”  the 
generalized  thermodynamic  forcing  X=k*Vy5  where 
5  =  -J  In  (P)  dP  is  entropy,  denotes  the  gradient  of 
entropy  with  respect  to  the  Y,  and  K  supplies  the  cou¬ 
pling  with  which  VyS  force  dY/dt.  As  we  don’t  know 
P,  hence  we  don’t  know  S,  or  VyS  and  we  don’t 
know  K,  all  this  looks  like  useless  window  dressing. 
Maybe  not.  If  we  can  determine  some  Y  =  Y*  for 
which  YyS  is  small  (in  the  sense  much  smaller  than 
VyS  at  the  actual  Y)  we  could  try  to  expand  K*VvS  = 
k*V\v5*(Y=Y*).  Call  K»V\yS  =  C  so  it  doesn’t 
look  so  scary  and  we  have  only  two  problems:  what 
is  Y*  and  what  is  C? 

Y*  is  usually  obtained  by  thinking  about  dY/dt  = 
f(Y)  under  idealized  circumstances,  where  we  sup¬ 
pose  many  excited  degrees  of  freedom  while  omitting 
all  external  forcing  and  internal  dissipation  (here  re¬ 
garded  as  “external”  to  dynamics  of  Y).  Dynamics 
sometimes  are  further  simplified,  e.g.  to  quasi- 
geostrophy,  to  make  calculation  of  Y*  tractable. 
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Subject  to  integrals  of  the  motion  of  idealized  dYldt= 
f(Y),  Y*  is  the  Y  that  maximizes  S,  i.e.,  Vy-S'  =  0. 

Aside:  This  point  has  confused  onlookers  more 
than  any  other.  The  “theory”  appears  to  be  to  maxi¬ 
mize  entropy.  But  such  a  result  would  only  apply  to 
a  mathematical  idealization  (an  isolated,  unforced, 
nondissipative  system)  arguably  far  from  Earth’s 
oceans.  What  needs  be  emphasized  is  that  the  whole 
idea  is  to  use  Y*  as  a  means  to  access  non-zero  VyS 
in  order  to  complete  the  actual  equations  of  motion  of 
actual  Y.  This  is  not  a  “maximum  entropy”  theory  of 
anything. 

Couplings  C  remain  to  be  estimated  and,  in  my 
work  to  date,  are  largely  fudged.  (Y*  isn’t  so  great 
either.)  At  this  time  the  point  is  not  to  find  “the  an¬ 
swer”  (don’t  I  only  wish!),  but  rather  to  identify  the 
parts  of  the  answer  which  may  yield  to  successive 
efforts.  At  C  I  encounter  the  same  kinds  of  semi- 
empirical,  largely  fudged,  by-guess-and-by-golly  es¬ 
timations  which  are  characteristic  of  our  ability  to 
represent  oceanic  turbulence. 

Here  let  me  mention  a  second  approach  to  “X,”  re¬ 
cently  advanced  by  Joel  Sommeria  and  colleagues. 
The  idea  is  to  find  an  expression  for  overall  produc¬ 
tion  of  entropy,  dS/dt,  which  can  be  maximized  with 
respect  to  Y.  X  is  then  the  force  on  dYIdt  which 
maximizes  dS/dt.  Although  both  the  derivation  of 
dS/dt  and  the  assignment  of  constraints  for  maximiz¬ 
ing  dS/dt  have  raised  new  issues  and  new  uncertain¬ 
ties,  the  maximum  entropy  production  approach  of¬ 
fers  an  important  complement  to  the  entropy  gradient 
forcing  which  I  have  pursued.  Happily,  Igor  Polya¬ 
kov  has  compared  the  two  approaches  in  a  case  of 
Arctic  ocean  modeling  and  finds  pleasingly  similar 
results. 

So  what? 

A  reader  can  well  ask:  if  we  are  only  stirring  cream 
into  coffee,  isn’t  this  entropy  talk  a  lot  of  bother? 
Indeed  if  only  ocean  eddies  were  as  simple  as  the 
coffee,  we  should  hardly  bother.  For  much  of  what 
we  do  about  ocean  mixing  parameterizations,  effort 
to  recast  the  discussion  in  terms  of  entropy  calculus 
would  (most  likely)  only  append  a  superstructure 
over  what — ^practically — we  do  anyway.  On  the 
other  hand,  during  this  ‘Aha  there  were  two  very  dif¬ 
ferent  areas  of  research  where  results  were  not  “sim¬ 
ple”  in  the  sense  of  cofi’ee  turning  brown. 


First  recall  David’s  results  in  two-layer  flow  over 
topography.  Numerical  experiments  do  not  lead  to 
flattening  isopycnals  (reducing  APE)  and  do  not  lead 
to  uniform  PV.  What  are  eddies  doing?  The  sugges¬ 
tion,  which  would  need  to  be  quantified  suing  actual 
code  for  actual  geometry  of  David’s  experiments,  is 
that  eddies  move  the  two-layer  flow  to  nearly  the 
highest  entropy  it  can  attain.  One  approach  may  be, 
if  the  code  David  used  can  be  run  in  dissipationless, 
conservative  mode,  then  the  model  itself  can  be  let 
run  to  reveal  Y*.  There  is  no  reason  Y*  should  re¬ 
flect  either  minimum  APE  or  uniform  PV.  When 
actual  dissipation  and  forcing  (if  present)  cause  actual 
Y  to  depart  from  Y*,  eddy  fluxes  should  arise  in  the 
model  (testably)  approximately  proportionally  to  Y*- 
Y. 

Peter  reminded  us  of  an  older  illustration  from  sta¬ 
tistical  mechanics,  recalling  a  hypothetical  Arctic 
circulation  (from  myself  from  ‘Aha  Huliko’a,  1993!) 
in  which  rectified  (“neptune”)  flows  were  induced  by 
eddies.  While  those  early  results  were  barotropic, 
extensions  to  baroclinic  flow  apply  to  David’s  case. 

A  different  result  that  stirred  controversy  during 
‘Aha  was  George  Camevale’s  simulation  of  internal 
wave  breaking.  When  George  evaluated  vertical 
buoyancy  flux,  w  V,  where  buoyancy  b  =  (po-p)/po  is 
the  fractional  deficit  of  density  about  reference  po  and 
w  is  vertical  velocity,  spectral  contributions  were 
positive  (upwards)  over  nearly  all  k.  In  particular 
w'b'(k)>0  over  all  k  that  were  “turbulent”  by  any 
measure  of  “turbulence”.  Because  the  experiments 
were  stably  stratified,  mean  db/dz  >  0  and  the  turbu¬ 
lence  from  internal  wave  breaking  forced  b  up  the  b- 
gradient  (on  average),  “anti”-mixing.  This  is  not  stir¬ 
ring  the  cream  into  the  coffee!  What  was  wrong? 
Sentiments  at  ‘Aha  ranged  from  (1)  the  experiments 
were  performed  improperly  (wrong  large  scale  forc¬ 
ing)  to  (2)  analyzing  outcome  in  “z”  is  wrong,  and 
density  coordinates  should  be  used. 

Or  maybe  George  had  things  right,  as  indeed  (I 
think)  explains  the  differential  diffusion  which  I  re¬ 
port  elsewhere  in  these  proceedings.  If  George  was 
right,  why  w'b'(k)>01  Again  I’ll  only  speculate 
without  direct  access  to  George’s  output,  but  I  be¬ 
lieve  that  wave  “breaking”  efficiently  scatters  poten¬ 
tial  energy  PE  (as  b'^)  to  higher  wavenumbers.  Over 
most  k,  the  result  was  PE  >  KE  at  each  k.  In  these 
experiments  without  Coriolis,  Y*  for  internal  gravity 
modes  equipartitions  PE  and  KE.  Thus  w  'b  '(k)  >  0, 
converting  PE  =>  KE,  is  driven  by  Y-Y*. 
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Does  this  help?  A  shortened  list  of  guiding  princi¬ 
ples  can  read  only  “dS/dt  >  0”  but  the  ongoing  practi¬ 
cal  challenge  is  to  put  this  idea  to  work.  Progress  is 
slow  because  the  methods  are  unfamiliar.  But  I  think 
tangible  practical  progress  is  being  made. 


Representing  the  effects  of  mesoscale  eddies  in 
coarse-resolution  ocean  models 

T.J.  McDougall 

CSIRO  Marine  Research,  Hobart,  Tasmania  -  Australia 


Abstract.  Mesoscale  eddies  in  the  ocean  mix  fluid  parcels  in  a  way  that  is 
highly  constrained  by  the  stratified  nature  of  the  fluid — ^so  much  so  that  much  of 
our  intuition  about  ocean  mixing  comes  fi-om  thinking  in  density  coordinates. 
Temporal-residual-mean  (TRM)  theory  provides  the  link  between  the  different 
views  that  are  apparent  from  averaging  turbulent  flow  in  height  coordinates  and 
in  density  coordinates.  The  TRM  theory  reduces  the  parameterization  problem 
from  three  dimensions  to  two  dimensions  and  it  shows  how  the  divergent  part  of 
the  relevant  eddy  density  flux  is  skew-symmetric  in  height  coordinates  and  that 
the  total  advection  velocity  can  be  adiabatic.  The  Gent-McWilliams  scheme  is 
best  interpreted  as  a  scheme  for  implementing  the  temporal  residual  mean.  The 
TRM  theory  has  very  specific  things  to  say  about  how  the  tracers  in  coarse- 
resolution  models  should  be  interpreted  and  how  the  skew  diffusion  streamfunc- 
tion  should  approach  zero  at  boundaries.  Here  it  is  emphasized  that  the  extra  ad¬ 
vection  in  the  TRM  theory  is  not  the  bolus  transport  because  the  extra  TRM  ad¬ 
vection  is  non-divergent  and  diapycnal  in  character  while  the  bolus  velocity  is 
divergent  and  adiabatic.  When  using  eddy-permitting  numerical  model  results  to 
deduce  a  parameterization  for  mesoscale  eddies,  it  is  very  important  to  use  the 
full  TRM  theory,  and  in  so  doing,  the  parameterization  task  is  simplified  some¬ 
what. 


Introduction 

Gent  and  McWilliams  (1990)  realized  that  when  averag¬ 
ing  in  density  coordinates,  the  thickness-weighted  average 
velocity  involves  the  bolus  velocity,  defined  as  the  flow 
along  density  surfaces  caused  by  the  correlation  between 
the  horizontal  velocity  and  the  thickness  between  adjacent 
density  surfaces.  This  motivated  them  to  recommend  addi¬ 
tional  terms  in  the  tracer  conservation  equations.  For  sev¬ 
eral  years  these  extra  terms  were  interpreted  as  being  unde¬ 
sirably  diapycnal  in  character;  for  example  it  was  empha¬ 
sized  that  while  the  flow  at  any  point  in  space  was  forced  to 
be  diapycnal  by  the  mixing  scheme,  the  area-averaged  flow 
was  not  diapycnal. 

The  paper  of  Gent  et  al.  (1995)  pointed  out  the  important 
property  that  the  total  flow  (the  sum  of  the  resolved-scale 
and  the  eddy-indueed  velocity)  did  not  have  a  diapycnal 
component.  Gent  et  al.  (1995)  also  cast  the  scheme  in 
terms  of  a  two-dimensional  streamfunction  and  they 
pointed  out  the  sign-definite  sink  of  domain-averaged 
gravitational  potential  energy  that  the  scheme  provides. 
While  the  motivation  of  both  Gent  and  McWilliams  (1990) 
and  Gent  et  al.  (1995)  came  from  the  bolus  velocity  of  den¬ 


sity  eoordinates,  it  is  argued  below  that  the  Gent  et  al. 
(1995)  mixing  scheme  is  not  a  parameterization  for  the  bo¬ 
lus  velocity  but  rather  is  a  parameterization  for  the  extra 
streamfunction  of  the  temporal-residual-mean  (TRM)  the¬ 
ory.  The  TRM  theory  says  unambiguously  how  one  must 
interpret  the  variables  that  are  carried  by  coarse-resolution 
ocean  models  and  it  also  provides  the  physical  justification 
for  the  tapering  of  the  extra  streamfunction  to  zero  at  ocean 
boundaries.  We  begin  with  a  quick  revision  of  the  theory 
and  then  discuss  some  common  misconceptions  about  the 
Gent  et  al.  (1995)  parameterization  scheme. 

The  temporal-residual-mean  approach 

The  averaging  operator  (the  overbar)  here  is  defined  to 
be  a  low-pass  temporal  average  and  primed  quantities  are 
the  deviations  from  this  low-passed  value.  For  simplicity 
the  equation  of  state  is  taken  to  be  linear  (McDougall  and 
McIntosh,  2001)  have  derived  the  TRM  theory  for  a  non¬ 
linear  equation  of  state).  The  usual  conservation  equations 
for  Eulerian-averaged  density  and  for  half  the  density  vari¬ 
ance  are 
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7,+v-(uf)=(2-v(uy)  (1) 

D,0  =^-Uy-Vf +0(y).  (2) 

Here  0  s  i/2  /'■  is  half  the  density  variance  measured  at  a 
fixed  point  in  space,  and  the  terminology  O(a’)  indicates 

terms  that  are  of  cubic  or  higher  order  in  perturbation  am¬ 
plitude.  Q  represents  the  effects  of  molecular  diffusion  and 
small-scale  mixing  processes. 

In  order  to  develop  residual-mean  conservation  equations 
that  apply  to  unsteady  flows  we  need  to  admit  the  possibil¬ 
ity  that  the  Eulerian-mean  density  may  not  be  the  most  ap¬ 
propriate  mean  density  to  appear  in  the  mean  density  con¬ 
servation  equation.  For  example,  the  Eulerian-mean  den¬ 
sity,  f{x,y,z,t),  describes  a  density  surface  whose  aver¬ 
age  height  is  not  that  of  the  original  Eulerian  averaging,  z- 
The  appropriate  mean  density  is  the  one  whose  surface  is, 
on  average,  at  the  height  of  the  averaging.  This  density  can 

be  expressed  in  terms  of  /  and  (j)  by 

f  =Y-{^/f.)^+0[a^).  (3) 

The  distinction  here  is  between  averaging  density  at  a 
given  height,  z,  and  averaging  the  height  of  a  given  density 
surface  7  =  7.  McDougall  and  McIntosh  (2001)  used  the 

density  variance  equation  (2)  to  rewrite  the  mean  density 
conservation  equation  (1)  as 

f,-(-V-(Cf)=(2’-V-(U7)-VM-i-0(y) 

(4) 

=  2“-  V-(AVf)-V  N+0(y) 

where 


U+sVx('Fxk)='F^-k(V/^  •‘F) 
Q 


QY  ,  6,- 

fill 

ft  fz 

UJJ 

y  /  j.  ^ 


Yz  Yz 


1_ 

Yz 


+  0 


(5) 

(6) 

(7) 


and  M  and  N  are  non-divergent  fluxes,  expressions  for 
which  are  given  in  McDougall  and  McIntosh  (2001).  The 
anti-symmetric  tensor  A  is  defined  in  terms  of  the  two 
components  of  the  quasi-Stokes  streamfunction, 

4/^(4/.  vp.v),  as 


A  3 


0  0 
0  0 


'F' 

\p> 

0 


(8) 


Isopycnal  Interpretation  The  upper  line  of  equation  (4) 
shows  that  the  modified  density,  y ,  is  advected  with  the 

TRM  velocity,  U*=U-i-U*,  where  both  the  Eulerian- 

mean  velocity,  U,  and  the  quasi-Stokes  velocity,  U'*’,  are 

non-divergent.  This  equation  shows  that  the  TRM  velocity 
only  has  a  component  through  the  modified  density  surfaces 

because  of  the  modified  diabatic  source  term,  Q^.  This  is 
in  contrast  to  the  Eulerian-mean  velocity,  because  (see  (1)) 
the  Eulerian-mean  velocity,  U,  flows  through  the  Eule¬ 
rian-mean  density  surfaces  due  not  only  to  the  diabatic 
source  term,  Q ,  but  also  due  to  the  mesoscale  eddy  flux  of 
density. 

A  Taylor  series  expansion  shows  that  the  horizontal 
component  of  the  TRM  velocity,  V',  is  the  same  as  the 
thickness-weighted  horizontal  velocity  that  is  found  by  av¬ 
eraging  in  density  coordinates,  V  .  That  is 

V  =  V-kV^  =V  +  'F,+0(a'):=V"+0(a')  (9) 

where  V  is  the  horizontal  velocity  averaged  on  a  density 
surface  and  the  bolus  velocity, 

correlation  between  the  horizontal  velocity  evaluated  on  the 
density  surface  and  the  thickness  between  density  surfaces. 
Because  of  (9)  it  can  be  shown  (McDougall  and  McIntosh, 
2001)  that  the  TRM  conservation  statement  for  density  in 
the  upper  line  of  (4)  is  the  same  as  is  found  by  thickness- 
weighted  averaging  in  density  coordinates.  The  key  to 
making  this  connection  is  to  simply  ignore  the  non- 
divergent  flux,  M,  in  (4).  The  same  Taylor  series  approach 
shows  that  the  modified  source  term,  is  the  thickness- 
weighted  source  term  of  density  coordinates. 

The  skew  diffusion  approach  in  the  second  line  of  (4)  is 
preferable  numerically  because  the  quasi-Stokes  stream- 
function  is  less  spatially  differentiated  than  in  the  advective 
approach  of  the  upper  line  of  (4)  (GriffieSy  1998).  The  skew 
diffusion  of  density  is  given  by 

-AVf  =  -7^Y  +  k(^V;^7) 

and  if  it  takes  the  Gent  et  al.  (1995)  form  of  the  quasi- 
Stokes  streamfunction,  then  the  parameterization  of 
mesoscale  mixing  that  was  used  before  GM90  is  equivalent 
to  retaining  the  horizontal  component  of  this  skew  flux  but 
ignoring  the  vertical  component. 

The  Taylor  series  approach  also  furnishes  a  physical  ex¬ 
planation  of  the  quasi-Stokes  streamfunction,  V ;  at  any 
height,  z,  'F  is  the  contribution  of  temporal  perturbations  to 
the  horizontal  transport  of  water  that  is  denser  than  7  (z), 
the  density  of  the  surface  having  time-mean  height  z. 
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The  TRM  analysis  procedure  can  also  be  applied  to  the 
tracer  conservation  equations  (see  McDougall  and 
McIntosh,  2001)  and  one  finds  that  the  terms  required  to 
account  for  the  mixing  by  mesoscale  eddies  are  (i)  the  epi- 
neutral  mixing  of  passive  substance  along  the  local  neutral 
tangent  plane  {Griffies  et  al,  1998)  and  (ii)  the  skew  diffu¬ 
sion  of  tracer  with  the  same  skew  diffusion  tensor  as  for 
density,  namely  (8).  It  is  this  second  term  that  corresponds 
to  the  Gent  and  McWilliams  (1990)  scheme.  The  extra 
mixing  term  can  be  regarded  as  an  extra  advection  by  the 
quasi-Stokes  velocity,  or  as  the  skew  diffusion  term, 
~  V  •  (aVt)  .  An  important  new  finding  of  the  TRM  the¬ 
ory  is  that  the  tracer  conservation  equation  is  written  in 
terms  of  the  thickness-weighted  tracer  of  density  coordi¬ 
nates,  T .  This  finding  completes  the  correspondence  be¬ 
tween  each  of  the  terms  in  the  TRM  conservation  equations 
and  those  that  arise  in  density  coordinates. 

Cubic  order  terms  and  exact  TRM  equations.  All  of  the 
above  results  are  accurate  to  cubic  order  in  perturbation 
amplitude.  Recent  unpublished  work  has  demonstrated  that 
away  from  solid  boundaries  the  cubic  terms  that  appear  in 
the  above  expressions  are  quite  small  in  comparison  with 
the  quadratic  order  terms  that  are  to  be  parameterized.  It 
has  also  been  shown  by  McDougall  and  McIntosh  (2001) 
that  the  conservation  equations  are  exact  if  the  quasi-Stokes 
streamfunction  is  in  fact  as  described  in  the  physical  expla¬ 
nation  above,  rather  than  as  defined  in  (7).  The  exact  form 
of  the  quasi-Stokes  streamfunction  which  leads  to  exact 
tracer  conservation  equations  is 

W=]\dz’  (10) 

which  is  the  temporal  average  of  the  vertical  integral  of  the 
horizontal  velocity  integrated  between  the  fixed  height,  z, 
and  the  instantaneous  height  of  the  density  surface,  z+z^. 

If  we  were  ever  clever  enough  to  parameterize  this  form  of 
the  quasi -Stokes  streamfunction,  the  job  of  parameterizing 
the  skew  diffusion  by  mesoscale  eddies  would  be  complete 
and  the  analogy  with  the  equations  found  by  averaging  in 
density  coordinates  would  be  exact.  The  parameterization 
task  for  coarse-resolution  ocean  models  is  then  to  param¬ 
eterize  (10). 

In  summary,  I  believe  the  important  practical  features  of 
the  TRM  theory  are  that 

(i)  it  demonstrates  exactly  how  one  must  interpret  the 
tracers  that  are  carried  by  eddyless  ocean  models, 

(ii)  it  shows  that  the  job  of  parameterizing  the  dynamical 
effects  of  mesoscale  eddies  is  reduced  to  the  task  of  param¬ 
eterizing  the  quasi-Stokes  streamfunction,  (10)  (or  (7)  in  the 
approximate  form),  and 


(iii)  the  physical  interpretation  of  the  quasi-Stokes 
streamfunction  demands  that  it  be  tapered  smoothly  to  zero 
at  ocean  boundaries  (as  opposed  to  other  suggestions  of 
having  delta  functions  at  the  boundaries  or  of  allowing  flow 
through  the  boundaries). 

The  rest  of  this  paper  will  attempt  to  correct  some  com¬ 
mon  misconceptions  about  the  Gent  et  al.  (1995)  scheme, 
and  then  to  make  some  remarks  about  the  task  of  deducing 
eddy  parameterizations  from  eddy-permitting  models.  But 
first,  some  comments  are  in  order  about  other  ways  of  aver¬ 
aging  the  conservation  equations. 

Comparison  with  other  averaging  approaches.  There 
are  other  ways  of  averaging  the  conservation  equations,  for 
example  the  ‘effective’  approach  following  Plumb  and 
Mahlman  (1987),  the  “stochastic”  approach  of  Dukowicz 
and  Smith  (1997),  and  the  Lagrangian-mean  approach  of 
Andrews  and  McIntyre  (1978).  To  date  these  approaches 
have  proved  less  useful  than  the  TRM  theory  for  the  follow¬ 
ing  reasons.  When  the  “effective”  approach  is  extended  to 
temporal  averaging  in  three  dimensions  (as  opposed  to 
zonal  averaging)  one  finds  that  the  density  which  is  ad- 
vected  by  the  “effective”  velocity  is  devoid  of  a  physical 
interpretation  and,  in  particular,  does  not  have  a  connection 
to  averaging  in  density  coordinates.  Similarly,  the  extra 
two-dimensional  streamfunction  of  the  “effective”  approach 
is  written  in  terms  of  the  first  order  approximations  to  La- 
grangian  displacements  of  fluid  parcels  and  is  similarly  not 
amenable  to  an  isopycnal  interpretation. 

In  the  stochastic  approach  both  the  eddy  diffusion  coeffi¬ 
cient  and  the  mean  velocity  are  written  in  terms  of  probabil¬ 
ity  density  functions  of  the  eddy  motions,  but  what  is  miss¬ 
ing  is  a  connection  to  the  Eulerian-mean  velocity  that  is 
naturally  carried  by  the  momentum  equations  in  a  coarse- 
resolution  model.  Consequently,  various  authors  have 
made  different  assumptions  as  to  what  the  mean  velocity  in 
the  “stochastic”  approach  might  be,  with  correspondingly 
different  interpretations  of  the  mean  variables  and  of  the 
things  that  need  to  be  parameterized.  For  example,  the  pa¬ 
pers  of  Dukowicz  and  Smith  (1997)  and  Dukowicz  and 
Greatbatch  (1999)  both  use  the  same  ‘stochastic’  theory  but 
have  different  conclusions  because  of  different  assumptions 
about  the  physical  meaning  of  the  ‘stochastic’  velocity. 

The  Lagrangian-mean  theory  of  Andrews  and  McIntyre 
(1978)  is  beautifully  general  but  the  Lagrangian-mean  ve¬ 
locity  is  divergent  at  leading  order  and  the  Lagrangian- 
mean  tracer  values  are  not  readily  observable.  The  beauty 
of  the  TRM  theory  is  that  it  provides  a  one-to-one  match 
between  every  variable  (tracer,  density  and  velocity)  that 
arises  in  the  TRM  approach  with  the  same  variable  that 
arises  when  averaging  in  density  coordinates.  Since  the  rate 
of  oceanic  diapycnal  mixing  is  so  small  in  relation  to  the 
lateral  fluxes  of  mesoscale  eddies,  it  is  imperative  to  cleanly 
separate  these  two  types  of  mixing  processes,  and  all  of  our 
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intuition  in  this  regard  comes  from  thinking  and  averaging 
in  density  coordinates.  It  is  this  direct  link  to  averaging  in 
density  coordinates  that  makes  the  TRM  approach  attractive 
for  z-coordinate  models. 

Neither  GM90  nor  the  quasi-Stokes  velocity  is 
the  bolus  velocity 

The  stated  aim  of  Gent  and  McWilliams  (1990)  and  of 
Gent  et  ai  (1995)  was  to  provide  a  parameterization  for  the 
bolus  velocity  and  it  was  argued  that  this  should  be  directed 
down  the  isopycnal  gradient  of  thickness  so  that  the  extra 
horizontal  velocity  would  be  However,  in 

order  to  make  the  streamfunction  a  locally  determined 
quantity,  and  particularly  to  ensure  a  sign-definite  sink  of 
gravitational  potential  energy,  the  diffusivity  was  moved 
inside  the  vertical  derivative  so  that  the  streamfunction  of 
Gent  et  ai  (1995)  became  kV This  form  also  al¬ 
lowed  the  boundary  conditions  at  the  top  and  bottom  of  the 
ocean  to  be  easily  satisfied  by  having  the  diffusivity  go  to 
zero. 

If  one  makes  the  most  elementary  assumption  for  the 
first  term  in  (7)  that  the  horizontal  density  flux  is  directed 
down  the  horizontal  gradient  of  /  with  diffusivity  K  (that 

is,  y'y'zz  fjY)  and  if  one  ignores  the  second  term  in 
(7),  the  quasi-Stokes  streamfunction  becomes 
^  /f^  ,  exactly  as  we  advocated  in  Gent  et  al. 

(1995)  and  as  used  by  many  subsequent  authors.  In  this 
way,  this  simple  Fickian  assumption  on  the  horizontal  flux 
of  density  has  enabled  us  to  interpret  the  Gent  et  ai  (1995) 
scheme  as  a  particular  parameterization  for  the  quasi-Stokes 
streamfunction  of  the  TRM  theory.  But  from  (9)  it  imme¬ 
diately  follows  that  the  GM90  and  Gent  et  al  (1995) 
schemes  are  not  parameterizations  for  the  bolus  velocity 
because  the  horizontal  component  of  the  bolus  velocity  is 
different  from  the  horizontal  quasi-Stokes  velocity,  V, , 

by  the  amount  V- V  which  is  the  difference  in  the  hori¬ 
zontal  velocity  averaged  at  constant  depth  and  at  constant 
density. 

The  movement  of  the  diffusivity  inside  the  vertical  de¬ 
rivative  in  the  Gent  et  al.  (1995)  expression  for  the  eddy- 
induced  horizontal  velocity,  ,  far  from  be¬ 

ing  a  weakness  of  Gent  et  al.  (1995)  because  of  the  weaker 
connection  to  the  bolus  transport,  is  actually  a  very  impor¬ 
tant  strength  of  the  GM90  scheme.  Consider  for  example 
the  situation  where  the  slopes  of  density  surfaces  are  inde¬ 
pendent  of  height  so  that  the  original  idea  of  down-gradient 
thickness  flux  would  mean  that  the  eddy-induced  horizontal 
velocity  was  zero  at  all  heights  except  for  delta  functions  of 
eddy-induced  horizontal  velocity  at  the  top  and  bottom. 


Finite-amplitude  baroclinic  instability  operating  on  such  an 
initial  condition  will  lead  to  the  simultaneous  relaxation  (in 
the  mean)  of  isopycnals  throughout  the  entire  water  col¬ 
umn,  not  merely  at  the  boundaries  of  the  domain.  The 
TRM  approach  can  achieve  this  property  by  allowing  the 
diffusivity,  k,  to  be  a  function  of  height.  In  this  way  the 
quasi-Stokes  velocity  of  the  TRM  theory  can  achieve  a 
more  realistic  parameterization  of  baroclinic  instability  than 
is  possible  with  a  down-gradient  scheme.  This  type  of 
depth-dependent  diffusivity  is  being  pursued  by  Killworth 
(2001).  The  same  criticism  applies  to  an  up-gradienl  poten¬ 
tial  vorticity  mixing  scheme  which  has  often  been  sug¬ 
gested  as  a  parameterization  for  •  When  there  is  no 
epineutral  gradient  of  potential  vorticity  in  the  whole  water 
column,  we  do  not  expect  baroclinic  instability  to  cause 
mean  motion  and  relaxation  of  isopycnals  only  in  delta 
function  boundary  layers  as  would  occur  with  a  potential 
vorticity  parameterization. 

In  addition  to  the  issue  of  having  the  diffusivity  inside 
the  vertical  derivative,  there  are  two  other  compelling  rea¬ 
sons  why  the  eddy-induced  velocity  of  Gent  et  ai  (1995) 
(which  we  are  interpreting  as  the  quasi-Stokes  velocity  of 
the  TRM  theory)  is  neither  the  bolus  velocity  nor  the  down- 
gradient  thickness  flux.  The  first  of  these  is  that  the  quasi- 
Stokes  velocity  is  three-dimensionally  non-divergent 
whereas  the  epineutral  flux  of  thickness  is  divergent  (see 
McDougall  and  McIntosh^  2(X)1),  The  second  reason  is  that 
the  bolus  velocity  and  the  epineutral  thickness  flux  have  no 
diapycnal  component,  whereas  the  quasi-Stokes  velocity 
has  a  large  diapycnal  component  (see  for  example,  Figure 
6b  of  Hirst  and  McDougall,  1998).  These  differences  arise 
through  the  use  of  the  streamfunction  (that  is,  the  continuity 
equation)  to  construct  the  vertical  component  of  both  the 
quasi-Stokes  velocity  and  of  the  eddy-induced  velocity  in 
the  Gent  et  al.  (1995)  scheme. 

The  task  of  parameterizing  the  effects  of  mesoscale  ed¬ 
dies  is  very  different  for  isopycnal  models  than  for  height- 
coordinate  models.  In  an  isopycnal  model  the  extra  hori¬ 
zontal  velocity  that  is  needed  is  indeed  the  bolus  velocity. 
The  sum  of  the  bolus  velocity  and  the  resolved-scale  hori¬ 
zontal  velocity  of  an  isopycnal  model,  V,  gives  the  thick¬ 
ness-weighted  velocity  of  isopycnal  coordinates,  V.  In 
sharp  contrast  to  the  large  diapycnal  component  of  the 
quasi-Stokes  velocity  of  height-coordinate  models,  the  bo¬ 
lus  advection  that  is  needed  in  isopycnal  models  has  zero 
diapycnal  component  (see  McDougall  and  McIntosh,  2(X)1). 

Some  issues  in  parameterizing 

It  is  argued  above  that  the  effects  of  mesoscale  eddies 
enter  the  tracer  conservation  equations  through  (i)  the 
symmetric  diffusion  tensor  which  represents  the  passive 
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epineutral  mixing  of  tracer,  and  (ii)  the  skew  diffusion  ten¬ 
sor  (8)  whose  elements  are  the  quasi-Stokes  streamfunction, 
( 1 0),  of  TRM  theory.  Some  authors  have  suggested  that 
should  be  parameterized  as  being  up  the  gradient  of 
potential  vorticity  (measured  along  the  local  neutral  direc¬ 
tion).  Apart  from  the  small  meridional  change  in  the  Corio¬ 
lis  parameter,  this  suggestion  is  the  same  as  the  down- 
gradient  thickness  idea.  McDougall  and  McIntosh  (2001) 
have  argued  that  irrespective  of  whether  or  not  these  sug¬ 
gestions  turn  out  to  be  a  good  parameterization  for  V, 

they  leave  open  the  question  of  the  vertical  component  of 
the  quasi-Stokes  velocity.  Equivalently,  this  down- 
potential  vorticity  parameterization  does  not  address  the 
issue  of  how  behaves  at  the  top  and  bottom  boundaries 
of  the  ocean.  McDougall  and  McIntosh  (2001)  have  shown 
that  this  uncertainty  means  that  the  contribution  of  this 
parameterization  to  the  meridional  flux  of  heat  is  quite  un¬ 
certain.  This  uncertainty  is  equivalent  to  the  practical  diffi¬ 
culty  that  arises  when  parameterizing  the  bolus  velocity  in 
coarse-resolution  layered  ocean  models  where,  in  practice, 
the  interface  height  is  smoothed  rather  than  actually  intro¬ 
ducing  a  parameterized  bolus  velocity  to  the  model.  The 
key  to  overcoming  this  considerable  uncertainty  in  the  me¬ 
ridional  heat  flux  is  to  parameterize  the  quasi-Stokes 
streamfunction,  'P, ,  itself,  rather  than  its  vertical  deriva¬ 
tive. 

There  are  other  rather  general  problems  with  the  down- 
gradient  potential  vorticity  parameterization.  Cummins 
(2000)  has  pointed  out  that  such  a  parameterization  can  lead 
to  a  fictitious  torque  which  spontaneously  generates  angular 
momentum  while  Adcock  and  Marshall  (2000)  have  shown 
that  energy  conservation  is  violated  as  the  scheme  attempts 
to  homogenize  potential  vorticity  (or  thickness)  along 
neutral  density  surfaces. 

While  our  skill  in  parameterizing  the  quasi-Stokes 
streamfunction  is  clearly  in  its  infancy,  some  eddy- 
permitting  models  are  showing  intriguing  results.  For  ex¬ 
ample,  Treguier  (1998)  has  analyzed  a  primitive  equation 
model  for  the  zonally  averaged  bolus  velocity  and  has 
found  a  small  diffusivity  for  use  in  the  quasi-Stokes  diffu- 
sivity.  Such  a  result  would  be  expected  to  have  some  bene¬ 
fits  so  long  as  the  models  remain  stable  with  these  smaller 
diffusivities.  The  two  benefits  that  come  to  mind  are  avoid¬ 
ing  the  slowing  of  the  horizontal  circulation  of  the  sub¬ 
tropical  gyres  that  occurs  with  the  larger  values  of  the 
quasi-Stokes  diffusivity,  and  reducing  the  intrusion  of  Ant¬ 
arctic  Bottom  Water  into  the  North  Atlantic  which  is  too 
strong  with  present  values  of  the  quasi-Stokes  diffusivity 
{Hirst  and  McDougall,  1998). 

Given  the  above  problems  with  both  the  down-gradient 
thickness  and  the  up-gradient  potential  vorticity  schemes,  it 
seems  that  the  way  forward  for  the  foreseeable  future  will 


be  to  stay  with  the  general  Gent  et  ai  (1995)  form  of  the 
quasi-Stokes  streamfunction,  =  but  to  al¬ 

low  the  diffusivity,  x;  to  be  a  function  of  space. 

It  must  be  pointed  out  (following  Tandon  and  Garrett, 
1996)  that  the  adiabatic  nature  of  the  TRM  velocity  implies 
that  the  eddy  kinetic  energy  of  mesoscale  eddies  cannot  be 
dissipated  in  the  ocean  interior  but  rather  must  be  dissipated 
near  the  upper  and/or  lower  boundaries.  This  seems  a 
rather  special  restriction  on  the  energy  budget. 

When  seeking  to  determine  a  parameterization  for  the 
quasi-Stokes  streamfunction  from  the  output  of  eddy- 
permitting  ocean  models,  it  is  important  to  use  the  correct 
TRM  conservation  equations.  For  example,  Gille  and 
Davis  (1999)  have  analyzed  an  eddy-permitting  primitive 
equation  model  of  a  zonal  channel  with  respect  to  the  con¬ 
servation  of  the  Eulerian-mean  density,  f,  rather  than  the 
modified  density  of  (4),  and  they  also  considered  only  the 
first  term  in  the  expression,  (7),  for  the  quasi-Stokes 
streamfunction.  Analyzing  model  data  in  this  way  leaves 
the  extra  forcing  term, 

(-uy  vf/f^)  ,  (11) 

in  the  density  conservation  equation  and  Gille  and  Davis 
(1999)  conclude  that  this  term  is  too  large  to  ignore  and 
needs  to  be  parameterized.  McDougall  and  McIntosh 
(2001)  have  shown  that  this  extra  source  term  does  not  arise 
when  the  full  residual-mean  transformation  is  performed. 
Rather,  all  that  needs  to  be  parameterized  is  the  quasi- 
Stokes  streamfunction. 

Similarly,  Roberts  and  Marshall  (2000)  have  examined 
the  divergent  part  of  the  horizontal  density  flux  and  found 
that  it  was  directed  as  much  perpendicular  to  the  mean  den¬ 
sity  gradient  as  down  the  mean  gradient.  Unpublished  work 
has  shown  (at  least  for  the  Southern  Ocean)  that  while  this 
is  true  of  the  first  term  in  the  expression,  (7),  when  the  sec¬ 
ond  term  is  also  included,  the  full  quasi-Stokes  streamfunc¬ 
tion  is  much  closer  to  being  directed  down  the  mean  density 
gradient.  This  supports  the  contention  that  it  is  very  impor¬ 
tant  in  efforts  aimed  at  finding  parameterizations  for 
mesoscale  turbulence  that  the  conservation  equations  are 
carefully  and  accurately  derived.  When  all  the  terms  are 
kept,  as  has  been  done  in  the  above  TRM  equation  set,  the 
parameterization  task  seems  to  be  more  promising  than 
when  leading  order  terms  are  ignored. 

Conclusions 

The  intimate  relationship  between  averaging  in  density 
coordinates  and  the  TRM  conservation  equations  is  what 
provides  the  satisfying  physical  interpretations  for  the  vari¬ 
ous  quantities  that  arise  in  the  TRM  approach.  For  exam¬ 
ple,  the  quasi-Stokes  streamfunction,  (10),  provides  a  com- 
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pelling  link  between  the  coordinate  systems  since  it  is  the 
contribution  of  temporal  perturbations  to  the  horizontal 
transport  of  water  that  is  denser  than  y{z),  the  density  of 
the  surface  having  time-mean  height  z.  I  have  argued  that 
the  parameterization  task  we  face  is  to  parameterize  the 
quasi-Stokes  streamfunction;  not  the  horizontal  bolus  veloc¬ 
ity,  not  the  down-gradient  flux  of  thickness  and  not  the  up- 
gradient  flux  of  potential  vorticity.  While  it  might  seem 
tedious  to  keep  track  of  all  the  leading  order  terms  (second 
order  in  perturbation  quantities)  in  the  conservation  equa¬ 
tions,  when  seeking  to  parameterize  mesoscale  eddies,  it  is 
becoming  more  obvious  that  we  cannot  afford  to  ignore 
these  leading  order  terms. 
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Abstract.  Recent  advances  in  computer  architecture  allow  for  numerical  in¬ 
tegration  of  state-of-the-art  ocean  models  at  basin  scale  with  a  grid  resolution 
of  1/10  or  higher.  At  that  resolution,  the  Gulf  Stream’s  separation  at  Cape 
Hatteras  is  well  simulated,  but  substantial  differences  from  observations  are 
still  observed  in  its  path,  strength,  and  variability.  Several  high  resolution 
(1/12°)  North  Atlantic  simulations  performed  with  the  Miami  Isopycnic  Co¬ 
ordinate  Ocean  Model  (MICOM)  are  discussed  and  the  results  suggest  that, 
even  with  such  a  fine  grid  spacing,  the  modeled  large  scale  circulation  is  still 
quite  sensitive  to  choices  in  forcing  and  viscosity  parameterization. 


1.  Introduction 

Until  recently,  most  ocean  general  circulation  models 
(OGCMs)  had  great  difficulties  in  reproducing  the  basic 
pattern  of  the  Gulf  Stream.  The  modeled  Gulf  Stream 
had  in  general  the  tendency  to  separate  far  north  of 
Cape  Hatteras  and  to  form  a  large  stationary  anticy- 
clonic  eddy  at  the  separation  latitude  [see  Dengg  et  al. 
(1996)  for  a  review].  Simulations  with  grid  resolution  of 
1/10°  or  higher  are  now  able  to  realistically  represent 
the  Gulf  Stream  separation  {Paiva  et  al.,  1999;  Smith 
et  al.,  2000;  Hurlburt  and  Hogan,  2000).  These  results 
support  the  view  that  a  good  representation  of  the  in¬ 
ertial  boundary  layer  is  an  important  factor  in  the  sep¬ 
aration  process  (Ozgdkmen  et  al.,  1997).  The  fine  mesh 
size  also  resolves  the  first  Rossby  radius  of  deforma¬ 
tion  everywhere  in  the  subtropical  gyre  (marginally  in 
the  subpolar  gyre),  therefore  providing  a  good  represen¬ 
tation  of  barochnic  instability  processes  {Paiva  et  al., 
1999;  Smith  et  al.,  2000). 

However,  despite  the  more  realistic  behavior,  the  rep¬ 
resentation  of  the  Gulf  Stream  separation  differs  from 
one  simulation  to  the  next,  sometimes  significantly. 
This  paper  discusses  some  of  the  factors  influencing  the 
modeled  circulation  in  the  Miami  Isopycnic  Coordinate 
Ocean  Model  (MICOM) .  It  will  be  shown  that  even  with 
such  a  fine  grid  spacing,  the  viscosity  parameterization 
remains  of  importance  for  the  modeled  large  scale  ocean 
circulation. 


2.  Mean  sea  surface  height  fields 

Figures  1  and  2  display  the  mean  sea  surface  height 
(SSH)  from  two  MICOM  simulations  with  an  horizon- 


Figure  1.  3-year-mean  model  SSH  field  for  the  1/12° 
COADS-forced  MICOM.  The  viscosity  operator  is  a 
combination  of  biharmonic  (A4  =  VdAx^,  with  Vb  =  1 
cm/s)  and  Laplacian  {A2  =  max  [.1  Ax^x  deformation 
tensor,  VdAx],  with  Vb  =  .5  cm/s). 


tal  resolution  of  A(f>  =  1/12°.  The  horizontal  grid  is 
defined  on  a  Mercator  projection  with  the  resolution 
given  by  A0  x  A^cos(b),  where  b  is  the  latitude.  The 
first  simulation  (Fig.  1),  configured  from  28°S  to  65°N, 
was  integrated  with  MICOM  for  20  years  using  monthly 
chmatological  COADS-based  forcing  (including  fresh¬ 
water  flux)  plus  a  weak  restoration  to  monthly  climato¬ 
logical  surface  salinity  {Paiva  et  al,  1999,  Garraffo  et 
al.,  2001a,b).  The  second  simulation  (Fig.  2),  config¬ 
ured  from  28°S  to  70°N,  including  the  Mediterannean 
Sea,  was  first  spun-up  for  6  years  with  MICOM  us- 


39 


40 


CHASSIGNET  AND  GARRAFFO 


Figure  2.  2-year-mean  model  SSH  field  for  the  1/12° 
ECMWF-forced  MICOM.  The  viscosity  operator  is  the 
same  as  for  the  COADS-forced  nm  (see  caption  of 
Fig.  1). 

ing  monthly  climatological  ECMWF  atmospheric  fields 
(including  freshwater  flux)  plus  a  weak  restoration  to 
monthly  climatological  surface  salinity,  and  is  presently 
further  integrated  using  6-hourly  ECMWF  forcing  from 
1979  to  2000. 

In  both  simulations,  the  simulated  Gulf  Stream  path 
agrees  well  with  observations  until  the  location  of  the 
New  England  Seamounts  chain.  Eastward  of  the  chain, 
the  ECMWF-forced  run  (Fig.  2)  exhibits  a  path  that 
agree  well  with  observations  everywhere.  In  the  COADS- 
forced  run  (Fig.  1),  the  path  east  of  the  New  England 
Seamounts  chain  is  displaced  northward  by  about  1°  to 
2°.  This  northward  shift  in  the  COADS-forced  run  is 
associated  with  a  larger  than  observed  seasonal  migra¬ 
tion  of  the  path  [observed  annual  signal  of  up  to  100 
km,  north  of  the  mean  from  August  to  November  and 
south  of  the  mean  from  March  to  June  (A.  Mariano, 
1999,  personal  communication)].  This  higher  than  ob¬ 
served  seasonal  shift  results  primarily  from  the  fact  that 
the  MICOM  bulk  Kraus-Turner  mixed  layer  is  on  the 
average  deeper  in  the  COADS-forced  run  than  in  the 
ECMWF-forced  run  (not  illustrated).  The  deepening  of 
the  mixed  layer  in  winter  induces  a  decrease  in  the  mag¬ 
nitude  of  the  upper  layer  velocities  because  MICOM’s 
mixed  layer  does  not  allow  vertical  shear  (Kraus  and 
Turner,  1967).  A  deeper  mixed  layer  in  the  COADS- 
forced  run  therefore  implies  a  Gulf  Stream  that  is  less 
inertial  than  in  the  ECMWTF-forced  run.  The  end  re¬ 
sult  is  that  in  the  latter  run,  the  modeled  Gulf  Stream 
path  agrees  well  with  observations  and  does  not  exhibit 
a  higher  than  observed  seasonal  shift  in  latitude  east¬ 
ward  of  the  New  England  seamoimts  chain. 


The  impact  of  the  seamounts  on  the  Gulf  Stream 
path  and  variability  was  further  investigated  in  a  3-year 
sensitivity  experiment  with  GOADS  forcing  in  which 
the  bottom  topography  was  modified  by  removing  the 
New  England  seamounts.  The  impact  of  removing  the 
seamounts  on  the  Gulf  Stream  path  was  found  to  be 
neghgible  (not  illustrated). 

3.  Importance  of  the  viscosity 
parameterization 

When  the  grid  spacing  reaches  a  certain  threshold, 
the  energy  cascade  from  the  smeJl  to  the  large  scales 
should  be  properly  represented  by  the  model  physics. 
Dissipation  should  then  be  prescribed  for  nmnerical 
reasons  only  in  order  to  remove  the  inevitable  accu¬ 
mulation  of  enstrophy  on  the  grid  scale.  This  is  the 
reason  why  higher  order  operators  such  as  the  bihar¬ 
monic  form  of  friction  have  traditionally  been  favored 
in  eddy-resolving  or  eddy-permitting  numerical  simula¬ 
tions  {Holland,  1978;  Bryan  and  Holland,  1989;  Smith 
et  ai,  2000).  Higher  order  operators  remove  numerical 
noise  on  the  grid  scale  and  leave  the  larger  scales  mostly 
imtouched  by  allowing  dynamics  at  the  resolved  scales 
of  motion  to  dominate  the  subgrid-scale  parameteriza¬ 
tion  {Griffies  and  Hallberg,  2000). 

In  addition  to  numerical  closure,  the  viscosity  oper¬ 
ator  can  also  be  a  parameterization  of  smaller  scales. 
One  of  the  most  difficult  tasks  in  defining  the  param¬ 
eterization  is  the  specification  of  the  Reynolds  stresses 
in  terms  of  only  the  resolved  scales’  velocities  [see  Ped- 
losky  (1979)  for  a  review]  and  the  common  practice  has 
been  to  assume  that  the  turbulent  motion  acts  on  the 
large  scale  flow  in  a  similar  manner  as  molecular  viscos¬ 
ity.  However,  the  resulting  Laplacian  form  of  dissipa¬ 
tion  removes  both  kinetic  energy  and  enstrophy  over  a 
broad  range  of  spatial  scales,  and  its  use  in  numerical 
models  in  general  implies  less  energetic  flow  fields  than 
in  cases  with  more  highly  scale-selective  dissipation  op)- 
erators.  In  order  to  assess  the  impact  of  the  dissipation 
operator  on  the  Gulf  Stream  system,  several  sensitivity 
experiments  were  performed  with  MICOM  using  Lapla¬ 
cian  and  biharmonic  operators  for  the  viscosity  in  the 
momentum  equations. 

The  mean  SSH  of  two  simulations  performed  with 
two  different  magnitudes  of  the  biharmonic  viscosity 
coefficient  are  displayed  in  Figs.  3  and  4,  respectively 
(COADS-forced  run).  When  a  relatively  small  value  of 
the  biharmonic  viscosity  coefficient  is  used  (see  caption 
of  Fig.  3  for  details),  the  western  boimdary  current  is 
seen  to  separate  from  the  coast  early  at  the  Charleston 
bump  before  Cape  Hatteras  (Fig.  3).  A  similar  result 
was  observed  with  the  1/10°  Los  Alamos  Parallel  Ocean 
Model  (POP)  during  the  spin-up  phase  in  which  both 
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Figure  3.  l-year-mean  model  SSH  field  with  a  bihar- 
monic  viscosity  operator;  A4  =  max  [.1  Ax^x  deforma¬ 
tion  tensor,  VdAx%  with  Vd  =  1  cm/s. 


Figure  4.  1-year-mean  model  SSH  field  with  a  bihar¬ 
monic  viscosity  operator;  A4  =  max  [Ax^  x  deformation 
tensor,  VdAx%  with  Vd  =  2  cm/s. 


the  viscosity  and  diffusion  had  to  be  increased  by  a 
factor  of  3  in  order  to  eliminate  this  feature  {Smith 
et  of.,  2000).  An  increase  in  the  magnitude  of  the  bi¬ 
harmonic  viscosity  operator  in  MICOM  did  indeed  also 
eliminate  the  early  detachment  seen  in  Fig.  3,  but  it  also 
led  to  the  establishment  of  a  permanent  eddy  north  of 
Cape  Hatteras  (Fig.  4).  This  eddy  results  from  a  se¬ 
ries  of  warm  core  (anticyclonic)  rings  that  propagate 
westward,  collide  with  the  western  boundary,  and  are 
only  weakly  dissipated  by  the  biharmonic  viscosity  op¬ 
erator.  This  behavior  is  reminiscent  of  other  simula¬ 
tions  performed  with  biharmonic  dissipation  {Smith  et 


al.,  2000).  The  fact  that  this  permanent  eddy  only  ap¬ 
pears  with  biharmonic  operators  seems  to  indicate  an 
incorrect  representation  of  the  eddy/mean  flow  and/or 
of  the  eddy/topography  intereictions,  possibly  because 
of  the  scale  selectiveness  of  the  higher  order  operator 
that  allows  features  that  are  marginally  resolved  by  the 
grid  spacing.  In  all  simulations,  the  grid  spacing  is  such 
that  both  the  inertial  and  the  viscous  boundary  layers 
are  resolved  (very  well  for  the  inertial  and  minimally 
for  the  viscous). 


Figure  5.  l-year-mean  model  SSH  field  with  a  Lapla^ 
cian  viscosity  operator;  A2  =  max  [.1  Ax^  x  deforma¬ 
tion  tensor,  VdAx],  with  Vb  =  1  cm/s. 

The  mean  SSH  of  the  simulation  performed  with 
the  Laplacian  viscosity  operator  is  displayed  in  Fig.  5 
(COADS-forced  run).  The  magnitude  of  the  Laplacian 
viscosity  coefficient  is  the  minimiun  value  needed  for  nu¬ 
merical  stability.  In  that  simulation  (Fig.  5),  the  Gulf 
Stream  separates  well  from  the  coast,  but  does  not  pen¬ 
etrate  further  than  the  New  England  Seamounts. 

Overall,  neither  the  Laplacian  nor  the  bihaxmonic 
viscosity  operator  alone  provide  satisfactory  results  re¬ 
garding  the  Gulf  Stream  system  behavior.  With  the 
biharmonic  operator,  eddies  are  found  to  retain  their 
structure  for  longer  periods  of  time  than  with  a  Lapla¬ 
cian  operator,  but  with  undesirable  effects  on  several 
features  of  the  large  scale  circulation.  With  the  Lapla¬ 
cian  operator,  the  western  boundary  current  and  its 
separation  are  well  represented,  but  with  a  weak  pene¬ 
tration  of  the  Gulf  Stream. 

With  a  Laplacian  (harmonic)  dissipation  operator, 
the  evolution  of  a  wave  c(t)e**'®  is  damped  exponentially 
with  a  spin-down  time  T2  =  Aj ^  (^sin  (^))~^ .  In 
the  case  of  a  biharmonic  operator,  the  spin-down  time  is 
T4  =  A4  ^  (^sin  (^^))  .  For  compcirison  purposes. 
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Figure  6.  Laplacian  and  biharmonic  decay  time  scale 
as  a  function  of  the  wavelength  k  for  values  of  the  dif¬ 
fusive  velocity  Vd  =  -5  and  1  cm/s,  respectively. 


constant  harmonic  and  biharmonic  viscosity  coefficients 
can  be  expressed  as  a  function  of  a  diffusive  velocity 
Vjo  and  the  grid  spacing  Ax  os  A2  =  VdAx  and  A4  = 
VdAx^,  respectively.  Examples  of  spin-down  times  for 
both  operators  are  given  in  Fig.  6  for  the  average  grid 
spacing  of  the  MICOM  simulations  (6  km).  For  the 
same  diffusive  velocity,  the  biharmonic  operator  more 
strongly  selects  the  small  scales  to  dissipate  and  leaves 
the  large  scales  relatively  untouched. 

The  Laplacian  experiment  of  Fig.  5,  when  contrasted 
to  the  biharmonic  experiments  of  Figs.  3  and  4,  sug¬ 
gests  that  some  damping  of  the  larger  scales  is  neces¬ 
sary  for  a  reasonable  western  boundary  current  behav¬ 
ior.  The  best  separation/penetration  results  were  ob¬ 
tained  in  the  COADS-forced  and  the  ECMWF-forced 
runs  shown  in  Figs.  1  and  2  in  which  the  viscosity  op¬ 
erator  was  prescribed  as  a  combination  of  the  bihar¬ 
monic  and  Laplacian  operators.  The  main  motivation 
for  combining  the  two  operators  (see  caption  of  Fig.  1 
for  details)  was  to  be  able  to  retain  the  scale  selective¬ 
ness  of  the  biharmonic  operator  and  to  provide  some 
damping  at  the  larger  scales  [performed  in  this  case  by 
the  Laplacian  operator  for  k  greater  than  80  km  (Fig. 
6)].  This  allowed  us  to  reduce  the  magnitude  of  the 
Laplacian  coefficent  A2  by  50%  and,  at  the  same  time, 
ensure  numerical  stability  with  an  effective  damping  of 
the  smaller  scales  via  the  biharmonic  operator  (Fig.  6). 
When  combined,  the  individual  diffusive  velocity  Vd 
specified  for  each  operator  is  smaller  than  the  mimi- 
mum  value  that  is  needed  for  numerical  stability  when 
only  one  of  the  operators  is  specified. 


4.  Summary  and  discussion 

These  results  appear  to  suggest  that,  in  a  realistic 
setting,  even  with  such  a  fine  grid  spacing,  the  mod¬ 
eled  large  scale  ocean  circulation  is  strongly  dependent 
upon  the  choices  made  for  the  viscosity  operators.  Fur¬ 
thermore,  it  appears  that  the  cascade  of  energy  from 
the  small  scales  to  the  larger  scales  may  not  take  place 
as  anticipated  and  that  some  large  scale  information 
is  needed  for  a  proper  representation  of  the  western 
boundary  current.  In  the  experiments  described  in  this 
paper,  the  latter  is  taking  place  via  the  Laplacian  vis¬ 
cosity  operator.  Hyperviscosity  (V^^  operator  with 
n  >  2)  is  often  used  in  numerical  simulations  of  tur¬ 
bulent  flows  to  extend  the  range  of  the  inviscid  iner¬ 
tial  cascade.  It  has,  however,  been  argued  that  it  may 
also  contribute  non-trivial  spurious  dynamics  {Jimenez^ 
1994).  While  it  can  be  firmly  stated  that  a  resolution  of 
1/10°  is  sufficient  for  the  Gulf  Stream  to  separate  from 
the  coast  at  Cape  Hatteras  {Paiva  et  o/.,  1999;  Hurlburt 
and  Hogan^  2000;  Smith  et  a/.,  2000),  it  is  not  yet  clear 
what  is  the  optimal  resolution  for  a  correct  Gulf  Stream 
penetration  and  variability.  A  four-fold  increase  in  reso¬ 
lution  from  1/16°  to  1/64°  with  the  Laplacian  operator 
in  the  hydrodynamic  (i.e.  no  thermal  forcing)  Navy 
Layered  Ocean  Model  (NLOM  -  Hurlburt  and  Hogan, 
2000)  brought  the  SSH  variability  to  observed  levels 
without  altering  the  pattern  of  the  large  scale  circu¬ 
lation.  While  numerical  simulations  at  the  above-noted 
resolutions  are  becoming  more  common,  they  still  de¬ 
mand  the  latest  in  computing  facilities.  A  four- fold  in¬ 
crease  in  resolution  for  the  thermodynamically  forced 
models  cannot  be  realistically  implemented  with  the 
present  computer  resources.  Thus,  further  evaluation 
of  the  impact  of  various  dissipation  operators  on  the 
large  scale  circulation  should  be  pursued. 
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Abstract.  Geophysical  turbulent  fluids  are  characterized  by  the  presence 
of  organized  energetic  structures  which  control  tracer  transport  and  stirring, 
while  enabUng  a  tracer  cascade  down  to  the  smallest  scales.  In  order  to 
understand  the  physical  mechanisms  involved  in  this  turbulent  tracer  cascade, 
we  focus  on  the  d3mamics  underlying  the  formation  of  tracer  gradients  which 
are  necessarily  associated  to  this  cascade.  We  show  that  the  dynamics  of 
tracer  gradients  in  physical  space  is  mainly  governed  by  their  orientation 
with  respect  to  the  compressional  eigenvector  of  the  strain  tensor.  This 
relative  angle  results  from  the  competition  between  strain  and  the  ’’effective 
rotation”  (due  to  both  vorticity  and  rotation  of  strain  axes).  The  implication 
is  that  tracer  gradients  (be  they  passive  or  active)  should  align  with  specific 
directions  of  the  flow  field,  which  depends  only  on  the  local  velocity  and 
acceleration  gradient  tensors  in  physical  space.  Most  of  the  tracer  stirring  is 
thus  occurring  at  specific  locations  that  can  be  identified  analytically.  These 
results  have  been  confirmed  by  direct  nvunerical  simulations  and  enable  a 
better  characterization  of  the  cascade  in  physical  space. 


1.  Introduction 

The  widely  observed  tracer  cascade  toward  small  spa¬ 
tial  scales  in  geophysical  flows  is  known  to  result  from 
stirring  by  mesoscale  eddies.  It  corresponds  to  the  for¬ 
mation  of  strong  horizontal  gradients,  such  as  those 
indicated  by  the  chlorophyl  concentration  at  the  sea 
surface  as  observed  by  satellite  (Figure  1).  Another 
observed  characteristic  is  that  horizontal  gradients  of 
different  tracer  fields  are  often  found  at  the  same  lo¬ 
cations  in  physical  space.  This  is  consistent  with  the 
interpretation  that  such  locations  result  from  the  com¬ 
mon  topology  of  the  underlying  flow  that  advects  the 
different  tracer  fields.  This  is  observed  for  instance  in 
Figure  2,  which  presents  airborne  measurements  of  dif¬ 
ferent  mixing  ratios  in  the  southern  stratosphere,  show¬ 
ing  sharp  horizontal  gradients  coinciding  at  several  loca¬ 
tions  {Tuck  et  a/.,  1992).  On  the  other  hand,  the  differ¬ 
ent  strengths  of  the  relative  mixing  ratios  |  Vc|/c  (where 
c  is  the  mixing  ratio)  that  are  observed  in  Figure  2,  are 
the  result  of  the  differences  in  the  past  history  of  the 
tracer  fields,  which  may  be  caused  by  differences  in  their 
sources,  forcings  and  sinks. 


Figure  1.  Chlorophyll  concentration  as  observed  by 
SEA-WIFF 
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Figure  2.  Different  mixing  ratios  measured  in  the 
southern  stratosphere  (from  Tuck  et  al,  (1992). 


0 


Figure  3,  Vorticity  field  of  an  isolated  vortex  submit¬ 
ted  to  a  externally  prescribed  strain  field  {Mariotti  et 
al,  1994). 


However,  the  existence  of  intense  gradients  in  tracer 
distribution  does  not  always  imply  the  occurrence  of 
strong  mixing  of  the  tracer  field.  Figure  3  presents  the 
vorticity  distribution  of  an  isolated  vortex  which  is  in¬ 
fluenced  by  an  externally  prescribed  strain  field  {Mari¬ 
otti  et  al,  1994).  This  setup  leads  to  a  constant  erosion 
of  the  vortex  structure,  which  is  taking  place  through 
the  ejection  of  filaments  which  will  eventually  be  de¬ 
stroyed  by  small-scale  mixing.  However,  as  seen  in  the 
insert  of  Figure  3,  the  distribution  of  vorticity  observed 
along  a  horizontal  mid-section  across  the  vortex  shows 
a  significant  reinforcement  with  time  of  the  gradients 
at  the  vortex  boundary.  Such  a  phenomenon  is  the  op¬ 
posite  of  mixing  and  corresponds  to  the  formation  of  a 
barrier  to  transport  that  inhibits  exchanges  across  the 
vortex  boundary. 

Our  main  purpose  has  been  to  attempt  to  character¬ 
ize  dynamically  this  tracer  cascade  in  physical  space. 
The  specific  objectives  are  to  study  the  equations  gov¬ 
erning  the  dynamics  of  tracer  gradients  in  order  to  lo¬ 
calize  the  barriers  to  mixing  as  well  as  the  regions  of 
filament  production.  The  chosen  approach  is  to  use 
information  from  both  the  velocity  u  and  accel¬ 
eration  fields  ^  in  order  to  go  beyond  the  simple 
kinematic  approach  of  the  chaotic  advection  literature, 
which  relies  on  the  sole  knowledge  of  the  velocity  field. 

In  what  follows,  we  have  moreover  made  the  assump¬ 
tion  that  as  far  as  oceanic  mesoscale  eddies  are  con¬ 
cerned,  the  classical  framework  of  the  quasigeostrophic 
turbulence  is  a  valid  first  step. 

The  paper  is  organized  as  follows.  We  first  recall  the 
basic  ingredients  that  influence  the  evolution  of  tracer 
gradient  by  considering  simple  flows  (section  2).  The 
equations  governing  the  dynamics  of  tracer  gradients 
are  then  studied  in  section  3,  leading  to  the  prediction 
of  preferred  alignment  of  tracer  gradients  with  specific 
orientations  for  different  regions  of  the  flow,  which  are 
found  to  depend  on  the  tensors  of  velocity  gradient  and 
acceleration  gradients  Vu  and  V^.  Section  4  presents 
results  from  numerical  simulations  of  two-dimensional 
turbulence  in  free-decay,  providing  evidence  of  statis¬ 
tical  validation  of  alignment  properties  of  the  tracer 
cascade  in  physical  space.  Section  5  summarizes  our 
results  and  mentions  possible  applications  of  the  tracer 
gradient  dynamics. 

2.  Simple  flows 

For  a  pure  strain  field,  the  streamfunction  field  'ip  — 
a  xy  corresponds  to  the  blue  isolines  of  Figure  4,  where 
a  denotes  the  strain-rate  magnitude.  An  initial  tracer 
blob  (red  continuous  isoline)  will  be  stretched  with  time 
into  an  elongated  pattern  (the  red  dotted  isoline)  and 
the  tracer  gradient  Vg  (black  vector)  will  tend  to  align 
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Figure  4.  Pure  strain  field 


with  time  with  the  compressional  principal  axis  of  the 
strain  tensor  (denoted  by  S"“  in  Figure  4).  This  align¬ 
ment  is  associated  with  an  exponential  growth  of  the 
tracer  gradient  norm. 

In  the  case  of  a  pure  vorticity  field  the  streamfunc- 
tion  field  is  'tp  =  f  +  y^),  where  tv  denotes  the  vor¬ 
ticity.  The  initial  tracer  blob  (red  continuous  isoline) 
simply  rotates  with  time  (the  red  dotted  isoline)  and  so 
does  the  tracer  gradient  vector  (Figure  5).  There  is  no 
growth  of  the  tracer  gradient  norm.  The  above  simple 
limits  can  be  obtained  from  the  results  of  Okubo  (1970) 
and  Weiss  (1991).  For  a  passive  tracer  q  that  obeys  the 
conservation  equation 


^  =  0 
Dt 


its  gradient  will  obey 

DVq 

Dt 


=  -[Vtl]*  Vg. 


(1) 


[Vu]*  denotes  the  transpose  of  the  velocity  gradient  ten¬ 
sor  for  which  eigenvalues  are  dbA^/^,  where  A  = 
depends  on  the  competition  between  strain  and  vortic¬ 
ity.  Both  authors  made  the  assumption  that  [VtZ]*  is 
slowly  varying  along  a  Lagrangian  trajectory  ( 0) 
so  that  the  tracer  gradient  equation  can  be  integrated, 
yielding 

Vg  =  Vqo  exp(±A^/^  t). 

Thus  in  regions  where  the  strain  rate  dominates  (A  >  0), 
there  is  an  exponential  growth  of  the  gradient  norm, 


Figure  5.  Pure  vorticity  field 


Figure  6.  Finite  size  axisymmmetric  vortex 


while  in  regions  where  vorticity  dominates  (A  <  0),  the 
solution  corresponds  to  a  simple  rotation  of  the  gradient 
vector. 

It  is  easy  to  find  simple  counterexamples  where  the 
Okubo- Weiss  results  do  not  hold  {Pierrehumbert  and 
Yang^  1993).  Consider  the  case  of  an  axisymmetric 
vortex  of  finite  size,  such  that  outside  the  vortex  core, 
the  vorticity  is  a;  =  0,  while  the  strain  rate  is  cr  ^  0 
and  streamlines  are  still  circular.  In  such  a  region, 
\  =  (T^  —  ur^  =  >0  and  the  Okubo- Weiss  crite¬ 

rion  predicts  an  exponential  growth.  However,  such  a 
case  can  be  integrated  analytically  {Lapeyre  et  a/.,  1999) 
and  the  solution  is  found  to  correspond  to  a  continuous 
rotation  of  the  tracer  gradient  with  a  linear  growth  of 
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its  norm  with  time.  The  Okubo- Weiss  criterion  fails 
because  the  rotation  of  the  principal  axes  of  the  strain- 
rate  tensor  has  not  been  taken  into  account,  and  this 
implies  ^  0.  Along  the  circular  Lagrangian  tra¬ 
jectories  (Figure  6),  the  principal  axes  constantly  ro¬ 
tate  so  that  the  tracer  gradient  vector  cannot  align 
with  the  compressional  strain  axis,  and  the  initial  tracer 
blob  (continuous  red  line)  is  mostly  linearly  distorted 
with  time.  Other  simple  counterexamples  can  be  con¬ 
structed,  involving  a  rotation  of  the  strain-rate  principal 
axes,  where  the  Okubo- Weiss  criterion  fails  and  can  be 
found  for  instance  in  Young  (1999). 


Figure  7.  definition  of  angles  6  and  C 


3.  Dynamics  of  tracer  gradient 

The  equation  that  governs  the  dynamics  of  tracer 
gradient  (1)  can  be  explicit  as 


Dt  2  \  (Ts  —  UJ  —CTn  ) 
where  the  following  quantities  have  been  used 


(2) 


—  d^u  dyU 

=  dxV  -1-  dyu 

uj  =  dxV  —  dyU. 

It  is  important  to  note  that  equation  (2)  is  a  vector 
equation  and  corresponds  to  two  degrees  of  freedom 
that  can  be  chosen  as  the  tracer  gradient  norm  |Vg| 
and  its  orientation  0, 


An  angle  (j>  that  characterizes  the  orientation  of  the 
strain  axes  with  respect  to  the  coordinate  axis  (Figure 
7)  can  be  introduced 


(  ^  =  cr  r  ^ 

\  ^  \  cos  2^  y 

The  two  scalar  equations  for  the  norm  |  Vg|  and  the  gra¬ 
dient  orientation  6  are  derived  in  Lapeyre  et  al  (1999). 

=  -<Tsin(2(0  +  4>)) 

2^  =  a;  —  (T  cos(2(0  -f  ^)) 

Both  scalar  equations  depend  only  on  the  relative 
angle 

C  =  2(0  +  0) 

(Figure  7)  between  and  the  compressional  axis  S“. 
We  have  seen  in  previous  simple  examples  that  it  is 
this  relative  angle  that  determines  the  growth  rate  with 


time  of  the  gradient  norm.  Introducing  a  nondimen- 
sional  Lagrangian  time  which  is  related  to  the  strain- 
rate  magnitude 


r  =  f  cr(t')dt', 

one  can  rewrite  both  scalar  equations  as 


D  log  IVqfp 
Dt 


=  —  sin  ^ 


^  =r- 
Dr  ^ 


cosC* 


(3) 


The  orientation  equation  (3b)  involves  a  nondimen- 
sional  parameter  r  which  is  defined  as 


r  —  ^  2D<j>IDt  ^  effective  rotation 
(T  strain  rate 

The  dimensionless  parameter  r  is  the  ratio  between  “ef¬ 
fective  rotation”^  in  the  strain  basis  (i.e.  the  rotation 
effects  due  to  both  the  vorticity  and  the  rotation  of  the 
principal  axes  of  the  strain-rate  tensor)  and  the  magni¬ 
tude  of  the  strain  rate  (which  tends  to  align  the  gradient 
with  a  strain  eigenvector). 

An  important  remark  is  that  r  remains  invariant  in 
a  change  of  coordinates  involving  solid  body  rotation, 
while  the  Okubo- Weiss  eigenvalues  A  do  not  remain 
invariant  in  such  a  change  of  coordinates.  The  new 
physics  that  have  been  taken  into  account  correspond 
to  the  quantity  ^  that  takes  into  account  the  fact  that 
the  principal  axes  of  strain  can  vary  along  a  Lagrangian 
trajectory,  also  implying  that  ^  0. 

Note  that  the  dynamics  of  the  orientation  equation 
(3b)  is  completely  independent  of  the  actual  value  of  the 
gradient  norm  |V^|,  and  the  solution  to  (3b)  will  depend 
on  the  actual  value  of  r,  whether  r  >  1  or  r  <  1. 


Strain- dominated  regions 

By  definition,  this  corresponds  to  |r(  <  1.  Making 
the  weaker  assumption  that  both  r  and  the  strain  mag- 


^We  follow  the  terminology  of  Dresselhaus  and  Tabor  (1991). 


CHARACTERIZATION  OF  TRACER  CASCADE  IN  PHYSICAL  SPACE 


49 


nitude  a  are  slowly  varying  along  a  Lagrangian  trajec** 
tory,  the  equation  for  the  orientation  (  has  two  fixed 
points  =  iarccosr,  an  unstable  one  C+j  and  a  sta¬ 
ble  one  One  expects  a  rapid  alignment  of  the  tracer 
gradient  with  the  stable  orientation  C-j  leading  to  an 
exponential  growth  rate  of  cr\/l  —  for  the  tracer  gra¬ 
dient  norm  (Lapeyre  et  al.^  1999).  Conversely,  the  un¬ 
stable  orientation  C-f  corresponds  to  a  strong  decay  of 
\Vq\.  In  the  case  where  |r|  =  1,  there  is  an  algebraic 
growth  of  |Vg|^  with  time. 

“Effective  rotation”-dominated  regions 

In  regions  where  the  effective  rotation  dominates, 
|r|  >  1.  There  is  no  fixed  point  solution  to  equa¬ 
tion  (3)  and  one  has  a  nonuniform  rotation  of  gradient. 
Since  ^  is  variable  the  gradient  tends  to  spend  most  of 
its  time  near  the  direction  with  minimal  rotation  rate 
{D^C/T>t^  =  0).  The  most  probable  orientation  of  this 
direction  is  a  such  that  {Klein  et  a/.,  2000) 

a  =  arctan  +  (1  —  sign{r))^^ 

which  depends  on  another  nondimensional  parameter  s 
which  is  defined  as 


which  measures  how  rapidly  the  stirring  time  scale  a~^ 
varies  along  a  Lagrangian  trajectory.  In  such  a  situa¬ 
tion,  the  gradient  norm  presents  only  a  weak  growth  or 
decay  rate  of  —a  s/ x/r^  -}-  s^. 

The  two  nondimensional  parameters  r  and  s  suffice 
to  characterize  the  topology  of  stirring  as  well  as  the 
time  evolution  of  the  tracer  gradient  magnitude  (growth 
or  decay).  For  instance,  a  saddle  point  corresponds  to 
the  values  r  =  0  and  s  =  0,  the  axisymmetric  vortex 
flow  to  the  values  |r|  =  1  and  s  =  0  and  the  strong 
rotation  limit  to  |r|  >>  1. 

Finally,  one  can  show  that  r  and  s  depend  on  both 
the  velocity  gradient  tensor  Vu  and  on  the  acceleration 
gradient  tensor  V  ^ . 

4.  Numerical  simulations 

In  order  to  test  the  above  analytical  predictions  for 
alignment  of  the  tracer  gradient,  numerical  simulations 
of  freely  decaying  turbulence  have  been  performed  at  a 
resolution  of  (1024)^. 

Figure  8  displays  the  vorticity  field  in  a  portion  of 
the  domain  where  a  strong  vorticity  filament  is  being 
stretched  between  two  cyclonic  vortices.  The  color  code 
is  such  that  red/brown  corresponds  to  positive  vortic¬ 
ity  and  blue  to  negative  values.  The  corresponding  field 


Figure  8.  vortictiy  a; 


of  the  parametere  r  is  given  in  Figure  9,  where  green 
indicates  regions  where  the  strain  dominates,  blue  and 
red  correspond  to  regions  where  effective  rotation  dom¬ 
inates.  In  the  latter  regions,  red  corresponds  to  the 
case  when  vorticity  contributes  the  largest  part  of  the 
effective  rotation  while  blue  corresponds  to  the  opposite 
situation.  Strips  of  yellow  correspond  to  |r|  =  1. 

The  parameter  r  presents  sharp  transitions  and  also 
a  smooth  behaviour  along  the  longitudinal  direction  of 
the  filamentary  patterns.  The  vortex  cores  are  regions 
with  r  <  —1  because  of  large  u).  Their  periphery  is 
composed  of  regions  with  <  1  because  of  large  cr 
and  regions  with  r  >  1  because  of  large  ^ .  For  each 
vortex,  we  observe  opposite  signs  of  r  between  its  core 
and  the  part  on  its  periphery  where  effective  rotation 
is  strong.  In  these  regions,  uj  2^  is  dominated  by 
2^  which  is  of  opposite  sign  of  uj.  This  indicates  that 
a  characterization  of  the  stirring  properties  of  vortices 
must  take  into  account  this  rotation  rate. 

The  results  for  the  alignment  of  the  tracer  gradient 
in  strain-dominated  regions,  where  r  <  1,  are  checked 
against  the  analytical  prediction  in  Figure  10, 

which  presents  the  joint  p.d.f.  of  C  +  7r/2  and  r,  and 
the  bold  curve  is  cos  C-  The  relation  cos  C  r*  is  well 
corroborated  and  this  strongly  validates  the  analytical 
solution.  On  the  other  hand,  a  joint  p.d.f.  between 
C“h7r/2  and  a;/c7,  which  corresponds  to  the  assumption 
of  Okubo- Weiss  that  implies  ^  —  0,  does  not  present 
such  a  correlation,  and  no  alignment  occurs  for  Okubo- 
Weiss  criterion  (Figure  11).  This  further  emphasizes 
the  quantitative  importance  of  the  rotation  of  the  strain 


50 


HUA,  KLEIN,  AND  LAPEYRE 


-i 


Figure  10.  PDF  of  C  +  7r/2  and  r  = 


Figure  11.  PDF  of  ^  +  7r/2  and  uj/a  for  Okubo- Weiss 
results 


axes. 

The  results  for  the  alignment  of  the  tracer  gradient 
for  regions  where  effective  rotation  dominates,  r  >  1 
are  checked  against  the  analytical  prediction  C  ^  in 
Figure  12,  where  the  joint  p.d.f.  of  (  and  a  is  plotted. 
Again,  the  numerical  simulations  confirm  that  there  ex¬ 
ist  preferred  directions  of  alignment  of  the  tracer  gradi¬ 
ent  that  depends  on  r  and  s. 


5.  Discussion 

To  summarize,  the  dynamics  of  tracer  gradients  as  a 
function  of  the  flow  topology  can  be  captured  by  two 
regimes.  The  first  one  is  well  characterized  solely  by 


Figure  12.  PDF  of  (  and  a 


the  parameter  r 

a 

that  measures  the  competition  between  the  strain  rate 
and  the  effective  rotation  which  takes  into  account  both 
the  vorticity  and  the  rotation  rate  of  the  principal  axes 
of  the  strain-rate  tensor  along  a  Lagrangian  trajectory 
{Lapeyre  et  al,  1999).  Strong  gradient  growth  occurs 
when  |r|  <  1.  The  second  regime  needs  another  pargim- 
eter  s 

Dt 

that  measures  the  variation  of  the  stirring  time  scale 
(a“^)  along  a  Lagrangian  trajectory  (Klein  et  ai,  2000), 
and  weaker  growth/decay  of  the  tracer  gradient  occurs 
for  |r|  >  1. 

In  both  regimes,  the  flow  topology  enforces  preferred 
orientations  for  the  tracer  gradient  vector,  which  will 
depend  on  both  local  properties  of  the  velocity  gra¬ 
dient  tensor  Vu  and  also  on  the  long-range  spatial 
influence  of  the  acceleration  gradient  tensor 
(Ohkitani  and  Kishiba,  1995;  Hua  and  Klein,  1998). 
Both  Vu  and  are  entirely  diagnostic  for  quasi- 
geostrophic  dynamics  and  their  computation  only  in¬ 
volves  the  resolution  of  Poisson  problems  in  which  right- 
hand  sides  are  nonlinear  functions  of  the  streamfunction 
field  at  a  given  time  (Hua  et  al,  1998). 

These  predictions  based  on  tracer  gradient  dynamics 
have  been  applied  to  two  other  problems  which  are  also 
related  to  the  more  general  issue  of  stirring. 

The  first  case  corresponds  to  the  detection  of  in¬ 
variant  manifolds  (i.e.  the  attracting/repelling  material 
lines  associated  with  local  maxima  of  particle  dispersion 
or  equivalently  to  local  maxima  of  tracer  gradients)  by 
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computing  the  persistence  of  hyperbolicity  defined  as 


along  a  Lagrangian  trajectory  in  regions  where  |r|  <  1 
for  a  finite  time.  This  technique  enables  us  to  locate  the 
presence  of  such  manifolds  in  the  immediate  vicinity  of 
coherent  vortices,  as  well  as  in  the  “far  field”  {Lapeyre 
et  al.,  2001).  For  the  second  case,  we  have  applied  the 
analytical  prediction  of  the  time  evolution  of  the  tracer 
gradient  norm  to  the  predictability  problem  with  the 
aim  of  identifying  the  regions  of  most  rapid  growth  of 
the  dynamical  structures  {Riviere  et  al,  2001).  The 
underlying  idea  is  that  initial  perturbations  that  are  re¬ 
sponsible  for  the  rapid  growth  of  the  structures  in  a 
given  flow  can  be  viewed  as  perturbations  of  the  initial 
potential  vorticity  field  of  the  flow.  As  such,  their  dy¬ 
namics  are  closely  linked  in  physical  space  to  the  regions 
which  present  the  largest  growth  in  potential  vorticity 
gradients.  Potential  vorticity  being  a  tracer  field,  the 
above  anal3dical  predictions  can  be  used  to  identify  the 
’’sensitive”  regions  of  most  rapid  growth. 
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Abstract.  Advective  transport  in  the  ocean  fundamentally  is  Lagrangian  in  na¬ 
ture,  At  model  grid  scales,  this  transport  is  parameterized  by  eddy  or  turbulent  ex¬ 
change  coefficients  in  analogy  to  molecular  diffusion.  Nevertheless,  much  of  the 
transport  is  by  advection  on  scales  larger  than  the  grid  cell  yet  smaller  than  basin 
scale.  Moreover,  flow  at  these  scales  tends  to  be  nonstationary  in  time  and  irregu¬ 
larly  distributed  in  space.  Recently  we  have  successfully  used  concepts  from  dy¬ 
namical  systems  theory  to  analyze  observations  of  such  flows.  The  analysis  is 
based  on  constructing,  from  synoptic  observations  of  currents,  material  boundaries 
within  the  flow  field  that  are  not  apparent  in  time  varying  Eulerian  velocity  fields. 
Here  a  short  description  of  the  approach  is  given  and  an  application  to  the  Gulf  of 
Mexico  is  described  where  the  analysis  precisely  identifies  the  boundaries  of  co¬ 
herent  vortical  structures  as  well  as  pathways  for  advective  transport. 


Introduction 

In  a  perceptive  paper,  Eckart  (1948)  developed  a  para¬ 
digm  for  ocean  stirring  and  mixing  in  the  ocean.  He  hy¬ 
pothesized  this  as  occurring  in  three  stages:  the  initial 
stage  where  coherent  structures  are  formed,  a  stirring 
phase  characterized  by  local  strong  gradients  of  fluid 
properties  and  localized  but  intense  small  scale  advection, 
and  a  final  stage  in  which  small  scale  mixing  obliterates 
the  gradients  and  produces  a  uniform  distribution  of  prop¬ 
erties. 

As  Eckart’s  analysis  is  particularly  pertinent  to  the  re¬ 
sults  described  here,  it  is  appropriate  to  review  his  theory. 
The  starting  point  is  the  diffusion  equation  for  a  substance 
C: 

dCjdt  +  UjdCjdxj  =  Kd^cjdxjdxj  (1) 

Here,  the  indicial  notation  in  which  repeated  subscripts  in 
a  term  are  summed  over  the  range  of  the  spatial  domain. 
The  remaining  terms  are  standard. 

Taking  the  gradient  of  (1)  yields 

(2) 

+  dUjjdXp  {dCjdXj^^  iQ^^idCjdx p^jdx ^dx ^ 

The  last  term  on  the  LHS  is  sometimes  referred  to  as 
“frozen-in.”  Frozen-in  fields  are  not  uncommon  in  hydro¬ 
dynamics.  The  potential  vorticity  equation  is  one  widely 
recognized  example  of  a  frozen-in  field.  Evolution  equa¬ 
tions  for  fields  that  include  the  frozen-in  term  change  in 


time  like  a  differential  line  element  moving  with  the  fluid. 
Moreover,  these  terms  can  be  used  to  construct  Lagran¬ 
gian  invariants  for  the  basic  field.  Although  frozen-in 
fields  and  associated  Lagrangian  invariants  are  of  funda¬ 
mental  interest  in  the  theory  used  below,  they  are  not  pur¬ 
sued  here.  Interested  readers  are  referred  to  Kuz'min 
(1984)  for  details. 

As  is  well  known  in  theoretical  hydrodynamics,  the  ve¬ 
locity  gradient  term  in  (1)  can  be  divided  into  an  isotropic 
tensor  whose  eigenvalue  is  the  fluid  divergence,  a  trace¬ 
less  and  symmetric  deviator  (called  the  velocity  strain  rate 
or  deformation),  and  a  skew-symmetric  tensor  called  the 
spin,  which  is  one  half  the  vorticity.  These  quantities  are 
denoted  respectively  as  A.p,<I)jp,£ijp.  A^.  accounts  for 

the  change  in  volume  of  a  fluid  parcel  with  no  change  in 
its  shape  or  orientation,  O^p  describes  the  change  of  shape 

of  the  parcel  without  change  in  volume  or  orientation,  and 
£2jp  quantifies  the  changes  in  orientation  with  no  shape  or 

volume  changes- 

What  effect  does  the  frozen-in  term  have  on  uncer¬ 
tainty  in  the  gradients  of  C?  To  examine  this,  multiply  (2) 
by  dCjdx  p  and  take  the  ensemble  average.  It  is  seen  that 

the  vorticity  term  sums  to  zero  so  that  the  evolution  equa¬ 
tion  for  the  variance  of  the  gradient  C  reduces  to 
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a  <  dC/dx^  dqdx^  >ldt  + 

Ujd(^  dC/dx^  dCldx^)jdXj  + 

2A{(}qd.,pC/dx,))  + 

20,{{)C/a,,){SC/B.,))=  ® 

Kd^(dCfdx^  dCfdx^'^  jdXjdXj  - 

2K{{d=C/dx,3x,){y-Cfi,,3,,)) 

In  (3)  A  is  1/3  of  the  divergence  of  the  velocity  field  and 
the  angled  brackets  are  the  ensemble  average  operator. 
The  noteworthy  item  about  (3)  is  that  the  vorticity  n 
does  not  effect  the  evolution  of  the  variance  of  the  gradi¬ 
ent,  even  though  it  is  the  largest  of  the  gradient  terms.  The 
physical  explanation  is  that  this  merely  reorients  the  vari¬ 
ance  but  does  not  change  its  value.  By  far,  the  dominant 
frozen-in  term  in  (3)  involves  the  deformation  . 

The  heuristic  picture  that  emerges  from  this  is  that  the 
vorticity  is  the  dominant  frozen-in  term  in  (2)  while  the 
deformation  is  critical  in  (3).  Thus,  in  the  Eckart  picture, 
the  vorticity  is  important  in  large-scale  circulation  effects 
on  the  gradient  (stirring)  while  the  fluid  deformation  is 
important  in  the  final  stage  of  mixing. 

Here  some  ideas  from  dynamical  systems  theory  are 
applied  to  quantify  the  Eckart  paradigm.  The  next  section 
reviews  the  requisite  material  from  dynamical  systems. 
This  is  applied  to  a  reduced  gravity  primitive  equation 
model  in  section  3  and  then  to  interactions  of  rings  and 
the  Loop  Current  in  the  Gulf  of  Mexico. 

Review  of  dynamical  systems  theory 

The  analysis  focuses  on  those  regions  in  the  flow  field 
that  are  dominated  by  fluid  deformation  and  not  vorticity. 
Generally  these  are  small  and  ephemeral.  As  deformation 
dominates  in  these  regions,  one  expects  particle  trajecto¬ 
ries  to  move  exponentially  fast  away  from  these  regions 
or  approach  a  critical  point  in  the  flow  field.  This  is  gen¬ 
erally  the  case,  but  under  appropriate  conditions  there  is  a 
special  or  distinguished  trajectory  that  does  neither.  In  the 
dynamical  systems  literature,  such  trajectories  are  called 
distinguished  hyperbolic  trajectories  or  DHTs.  DHTs  pos¬ 
sess  an  important  flow  property,  they  are  the  seeds  for  the 
material  surfaces,  or  manifolds  in  the  dynamical  systems 
literature,  that  define  the  barriers  and  pathways  to  advec- 
tive  transport. 

Identifying  the  DHT  and  constructing  the  manifolds  is 
still  at  a  trial  and  error  state.  It  is  hoped  that  improve¬ 
ments  in  this  phase  of  the  analysis  will  soon  improve  this 
phase  of  the  analysis  so  no  attention  is  given  here  to  this 
matter.  Interested  readers  should  consult  Guckenheimer 
and  Holmes  (1983)  or  Wiggins  (1990). 


The  essential  ideas  are  contained  in  an  example  due  to 
Ide  (2000).  Consider  a  flow  field  given  by 

dx/dt  =  (y-sin/)/2 
dyjdt  =  (x-3cos/)/2 

The  particle  paths  resulting  from  this  flow  are 

x  =  (X-l)cosh(t/2)+y  sinh(t/2)’f  cost 
y  =  (X  -  l)sinh(t/2)+  Y  cosh  (t/2)--‘ sin  t 

Here  (X,  Y)  are  the  starting  positions.  Consider  first 
the  particle  that  starts  at  X=l,  Y=0.  The  trajectory  of  this 
particle  is  a  circle.  This  trajectory  differs  from  all  other 
particle  trajectories,  which  exhibit  exponential  time  be¬ 
havior.  For  example,  particles  starting  along  the  line  y=l- 
X  will  flow  towards  the  DHT  as  it  executes  its  circular 
trajectory.  All  other  particles  will  flow  away  from  the 
DHT  along  the  line  y=x-l .  The  lines  y  =  ±  (x  - 1)  define 
the  inflowing  and  outflowing  manifolds  for  this  example. 

It  is  important  to  note  that  particles  cannot  cross  the 
quadrants  delineated  by  the  eigen-directions  since  they 
are  asymptotes  for  the  trajectories.  Thus,  the  curves  along 
the  eigen-directions  are  the  manifolds.  The  inflowing 
manifold  is  along  the  northwest-southeast  direction  and 
the  outflowing  manifold  is  along  the  other  eigen- 
direction.  Although  the  deformation  is  everywhere  con¬ 
stant,  locally  the  manifolds  emanate  from  the  DHT  and  so 
they  move  in  space  and  time. 

In  natural  settings  one  expects  a  multitude  of  DHT 
within  a  domain.  Thus,  one  could  expect  a  complicated 
web  of  intersecting  inflowing  and  outflowing  manifolds 
associated  with  different  DHT.  Such  intersections  are 
called  principal  intersection  points  or  PIP.  Particles  flow¬ 
ing  along  an  outflowing  manifold  will  transfer  to  the  in¬ 
flowing  manifold  at  a  PIP.  Examples  of  this  are  demon¬ 
strated  below. 

Manifolds  in  a  reduced  gravity  primitive 
equation  model 

Here  a  few  examples  of  DHT  and  manifolds  are  given 
for  a  reduced  gravity  primitive  equation  model.  The 
model  domain  is  2000  km  on  a  side  and  the  forcing  is  by 
a  meridianal  wind  stress.  See  Poje  and  Haller  (1999)  for 
details. 

Figure  1  shows  the  flow  field  with  several  manifolds 
and  DHT.  Note  that  the  two  eddies  north  of  the  jet  are 
connected  by  manifolds.  This  implies  some  mass  ex¬ 
change  between  these  structures.  Also  note  the  manifolds 
bounding  the  jet.  These  indicate  that  there  is  no  advective 
transport  across  the  jet  in  those  regions.  In  fact,  the  only 
communication  between  the  waters  on  either  side  of  the 
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jet  are  through  manifolds  that  develop  when  eddies  are 
shed. 


DOUBLE  GYRE  MANIFOLDS:  DAY  51 


KM 

Figure  1 .  Flow  field  with  several  manifolds  and  DHT. 

Figure  2  depicts  an  example  of  the  power  of  manifolds 
to  channel  advective  stirring.  Two  blobs  are  started  in 
about  50  kilometers  apart.  The  one  just  to  the  east  is  ini¬ 
tialized  around  a  DHT  so  it  is  expected  to  be  stretched 
along  the  outflowing  manifold.  The  other  blob  is  initial¬ 
ized  near  a  principal  intersection  point  or  the  intersection 
of  an  inflowing  manifold  of  one  DHT  and  the  outflowing 
manifold  of  another  DHT.  It  is  attracted  nearly  equally  to 
both  and  so  the  stretching  effects  are  canceled.  The  result 
is  that  this  blob  is  advected  parallel  to  the  outflowing 
manifold  but  with  negligible  stretching. 
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DOUBLE  GYRE;  DAY  44 


Figure  2.  Manifolds  that  channel  advective  stirring. 


Interactions  of  rings  with  the  Loop  Current 

The  purpose  of  this  section  is  to  apply  the  dynamical 
systems  methods  illustrated  above  to  ocean  data.  To  do 
this  we  use  the  University  of  Colorado  model  of  the  Gulf 
of  Mexico.  This  is  a  24-layer  primitive  equation  model 
with  1/12  degree  resolution,  realistic  bottom  topography, 
and  modified  Mellor  Yamada  turbulence  closure.  It  also  is 
run  in  a  predictive  mode  assimilating  altimeter  data, 
ABTs,  and  NOGAPS  winds.  For  more  details  on  the 
model  and  an  assessment  of  its  performance  see  Toner  et 
al  (2000), 

Figure  3  shows  a  time  slice  of  the  model  sea  surface 
height  and  currents  at  50  meters.  Attention  is  directed  to 
the  eastern  portion  of  the  model  domain  where  the  Y uca- 
tan  Current  enters  the  Gulf  and  makes  a  large  loop  before 
egressing  through  the  Straits  of  Florida.  To  the  west  of  the 
Loop  Current  is  a  large  anticyclonic  ring  that  was  shed 
several  months  before,  and  to  the  east  is  the  Tortugas 
Eddy,  a  large  semi-permanent  cyclone  sitting  near  the 
western  most  Florida  Key.  Kiitnetzkov  et  al  (2000)  pro¬ 
vide  more  details  of  the  flow  field. 


GULF  OF  MEXICO  (SOM):  DAY  101.00 


rents  at  50  m. 

Four  DHTs  were  located  in  this  region  and  inflowing 
and  outflowing  manifolds  were  calculated  for  each.  These 
are  shown  in  Figures  4  and  5.  It  is  important  to  note  that 
the  manifolds  provide  precise  (but  not  necessarily  accu¬ 
rate)  delineations  of  the  boundaries  of  the  coherent  struc¬ 
tures  noted  above.  Since  they  represent  boundaries  to  ad¬ 
vective  transport  the  manifolds  provide  a  quantitative 
measure  of  the  size  and  shape  of  these  structures. 
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Figure  4.  Inflowing  and  outflowing  manifolds  calculated  for 
four  DHTs. 
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Figure  5.  Evolution  of  a  small  cyclone  originally  to  the  north¬ 
east  of  the  large  anticyclonic  ring. 


Figure  5  shows  the  evolution  of  a  small  cyclone  origi¬ 
nally  to  the  northeast  of  the  large  anticyclonic  ring.  As 
seen  in  this  figure  the  cyclone  moves  anticyclonically 
around  the  ring  and  actually  cuts  off  approximately  20%. 
This  phenomenon  was  missed  in  the  original  examination 
of  the  Eulerian  model  velocities  and  height  fields.  The 
Lagrangian  analysis  used  here  highlighted  this  event. 

Summary  and  discussion 

This  analysis  is  based  on  concepts  from  dynamical  sys¬ 
tems  theory.  The  basic  ingredients  arc  hyperbolic  points 
or  saddle  points  and  DHTs.  The  former  is  characterized 
by  real  eigenvalues  of  the  velocity  gradient  at  stagnation 
points.  They  have  special  significance  in  this  approach  in 
that  they  often  signify  a  nearby  DFIT.  These  latter  objects 
serve  as  seeds  for  calculating  inflowing  and  outflowing 
manifolds  or  material  boundaries  that  determine  the  ad- 
vective  transport  pathways  through  a  maze  of  coherent 
flow  structures.  In  complex  flows,  such  as  the  Loop  Cur¬ 
rent  region  of  the  Gulf  of  Mexico,  manifolds  connect  re¬ 
gions  of  intense  mixing,  in  the  language  of  Eckarf  (1948), 
the  manifolds  delineate  stirring  regimes  and  connect  re¬ 
gions  of  intense  mixing. 

These  pathways  have  now  been  found  in  simple  time 
dependent  QG  and  primitive  equation  models  and  in  data 
assimilative  GCMs  1992).  Additionally,  these 

techniques  have  been  applied  to  merged  drifters,  moor¬ 
ings  and  MOD  AS  data  set  on  the  LATEX  Shelf  {Schulz, 
1999).  It  seems  then  they  arc  telling  us  that  stirring  and 
mixing  in  the  ocean  may  be  highly  time-  and  space- 
dependent  and  arc  governed  by  relatively  simple  proc¬ 
esses  determined  by  Lagrangian  analysis.  However,  quan¬ 
tification  of  the  importance  of  manifolds  and  DHTs  to 
stirring  and  mixing,  relative  to  traditional  exchange  proc¬ 
esses,  will  have  to  await  extension  of  these  methods  to 
three  dimensions. 

A  comment  on  the  stretching  of  blobs  when  they  ap¬ 
proach  a  DHT  along  an  inflowing  manifold  is  appropriate. 
The  analysis  indicates  that  a  blob  may  be  stretched  hun¬ 
dreds  of  kilometers  along  the  outflowing  manifold,  a 
purely  advective  process.  We  know  of  no  observational 
evidence  for  this.  Rather,  it  seems  more  likely  that  the 
blob  undergoes  intense  small  scale  mixing  and  may  lose 
its  identity.  But,  both  the  reduced  gravity  primitive  equa¬ 
tion  model  and  the  assimilative  model  of  the  Gulf  of  Mex¬ 
ico  include  sophisticated  and  well-timed  mixing  parame- 
terizations.  That  the  models  are  unable  to  predict  the 
likely  evolution  of  the  blob  in  near  a  DHT  suggests  a  de¬ 
ficiency  in  the  present  generation  of  mixing  parameteriza- 
tions. 
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Prom  stirring  to  mixing  of  momentum:  cascades  from 
balanced  flows  to  dissipation  in  the  oceanic  interior 

James  C.  McWilliams^,  M.  Jeroen  Molemaker^,  and  Irad  Yavneh^ 


Abstract.  Under  the  influences  of  stable  density  stratification  and  Earth’s 
rotation,  large-scale  flows  in  the  ocean  and  atmosphere  have  a  mainly 
balanced  dynamics — sometimes  called  the  slow  manifold — in  the  sense  that 
there  are  diagnostic  hydrostatic  and  gradient-wind  balances  that  constrain 
the  fluid  acceleration.  The  nonlinear  balance  equations  are  a  successful 
approximate  model  for  this  regime,  and  we  have  identified  mathematically 
explicit  limits  of  their  tune  integr ability.  We  hypothesize  that  these  limits 
are  indicative,  at  least  approximately,  of  the  transition  from  the  larger-scale 
regime  of  inverse  energy  cascades  of  anisotropic  flows  to  the  smaller-scale 
regimes  of  forward  energy  cascade  to  dissipation  of  more  nearly  isotropic 
flows  and  intermittently  breaking  inertia-gravity  waves.  In  the  oceans  these 
regime  transitions  occxu  mostly  in  the  scale  range  of  0.1-10  km— in  between 
the  mesoscale  and  fine-structure — where  Rossby  (Ro),  Proude  (Fr),  and 
Richardson  (Ri)  numbers  are  typically  neither  small  nor  large.  In  pursuit  of 
testing  this  hypothesis  we  have  revisited  several  classical  problems,  including 
gravitational,  centrifugal/syinmetric,  elliptical,  barotropic,  and  baroclinic 
instabilities.  In  all  cases  we  find  definite  evidence,  albeit  still  incompletely 
imderstood,  of  fluid-dynamical  transitions  in  the  neighborhood  of  loss  of 
balanced  integrability. 

Introduction 

The  general  circulation  of  the  ocean  is  forced  by  sur¬ 
face  fluxes  of  heat,  water,  and  momentum  primarily  at 
large  space  and  long  time  scales.  The  circulation  has 
comparably  large  and  long  scales,  as  well  as  important 
smaller  ones  associated  with  equatorial  zonal  and  lat¬ 
eral  boundary  currents  and  with  the  dominant  insta¬ 
bility  modes  at  mesoscales.  All  of  these  circulation  el¬ 
ements  approximately  satisfy  geostrophic,  hydrostatic, 
and  incompressible  dynamical  balances. 

How  does  the  energy  dissipation  occur  for  the  gen¬ 
eral  circulation  in  an  equilibrium  balance  with  the  gen¬ 
eration  by  surface  fluxes?  Some  of  the  dissipation  im- 
doubtedly  occurs  within  turbulent  boimdary  layers  near 
the  surface  and  bottom.  Some  dissipation  also  occurs 
through  creation  of  internal  gravity  waves  by  flow  over 
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topography,  with  subsequent  wave  propagation  into  the 
interior  and  a  wave-dynamical  cascade  (sometimes  in¬ 
volving  breaking)  down  to  dissipation  at  small  scales. 
Each  of  these  routes  to  dissipation  involves  an  extrac¬ 
tion  of  energy  from  the  circulation  near  the  vertical 
boxmdaries,  although  the  bulk  of  the  energy  resides 
in  the  vertical  interior.  A  more  local  route  is  directly 
through  the  interior,  turbulent  cascade  dynamics  of  the 
circulation.  In  oceanic  general  circulation  models,  the 
local  route  to  dissipation  is  implied  by  the  use  of  eddy 
diffusivities  to  parameterize  this  cascade.  The  purpose 
of  this  article  is  to  examine  the  mechanism  for  the  local 
route  to  dissipation,  without  here  trying  to  assess  the 
relative  contributions  among  these  alternative  routes. 

Our  conceptual  view  of  the  mechanism  is  the  follow- 
ing: 

Large-  and  mesoscale  circulations  typically  satisfy 
a  balanced  dynamics  (as  defined  below),  which 
have  little  interaction  with  the  inertia-gravity  wave 
field; 
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balanced  turbulent  cascades  are  very  inefficient  in 
energy  dissipation; 

there  are  explicitly  specifiable  limits  to  the  regime 
of  balanced  d3mamics  that  are  violated  sometimes 
for  the  circulation; 

violation  of  these  limits  leads  to  energy  transfer 
to  unbalanced  motions; 

turbulent  cascades  are  much  more  efficient  in  their 
dissipation. 

In  this  view  the  important  bottleneck  in  the  local 
route  to  dissipation  is  loss  of  balance  and  its  evolution¬ 
ary  consequences. 

Confirming  or  refuting  this  hypothesis  is  quite  chal¬ 
lenging,  since  it  involves  the  connectedness  of  turbulent 
cascades  spanning  several  dynamical  regimes.  Even  di¬ 
agnosis  of  the  degree  of  balance  can  be  subtle.  In  lieu  of 
making  a  more  general  test  of  the  hypothesis  as  yet,  here 
we  focus  on  the  special  situation  of  the  linear  instability 
of  balanced  steady  currents  in  a  rotating,  stratified  fluid 
in  relation  to  the  conditions  for  loss  of  balance.  Since 
fluid  instabilities  have  been  the  subject  of  much  prior 
research,  we  will  tell  the  story  from  both  historical  per¬ 
spectives  of  early  discovery  and  personal  perspectives 
of  the  imphcations  for  the  hypothesis  above — skipping 
over  most  of  the  literature  in  between. 

Balanced  Dynamics 

The  essential  basis  for  the  approximations  of  bal¬ 
anced  dynamics  is  velocity  anisotropy.  In  a  rotating, 
stratified  fluid  with  Coriolis  frequency  /  and  Brunt- 
Vmsala  frequency  N  and  away  from  boimdaries,  the 
evolution  from  general  initial  conditions  or  forcing  by 
the  process  called  geostrophic  (or  balanced)  adjustment 
leads  to  a  local  anisotropy  with  u,  v  >  u;,  while  radi¬ 
ating  away  inertia-gravity  waves.  Here  z  and  w  are  the 
coordinate  and  velocity  components  in  the  vertical  di¬ 
rection,  antiparallel  to  gravity,  and  (x,  y)  and  (u,  v)  are 
their  horizontal  counterparts.  The  condition  for  this  to 
occur  are  that  the  Rossby  and  FVoude  numbers, 

Ro  =  V/fL  and  Fr  =  V/NH,  (1) 

are  not  too  large  (where  V ,  and  L  are  characteristic 
values  for  (u,v),  z,  and  (x,  y)).  Under  these  conditions 
the  vertical  momentum  balance  is  approximately  hydro¬ 
static, 

<t>z  «  b,  (2) 

(<^  =  p/po  is  the  geopotential  function,  6  =  yfl-p/pj 
is  the  buoyancy,  and  po  is  the  mean  density);  the  diver- 
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gence  of  the  horizontal  momentum  balance  is  approxi¬ 
mately 

VV  «  ^  +  2[V»x*^j,y  -  (3) 

which  is  called  gradient-wind  balance  (V  is  the  hori¬ 
zontal  gradient  operator);  and  the  horizontal  velocity  is 
weakly  divergent  and  thus  can  be  approximately  repre¬ 
sented  by  a  streamfunction, 

u  —‘ipy  and  V  rs  (4) 

The  maximal  truncation  of  the  incompressible  (Boussi- 
nesq)  equations  consistent  with  these  approximations 
and  conservation  of  either  energy  (in  Cartesian  co¬ 
ordinates,  {x,y,z))  or  potential  enstrophy  (in  isopyc- 
nal  coordinates,  (X,Y,b))  is  called  the  balance  equa¬ 
tions  {Lorenz  [I960];  Gent  and  McWilliams  [1984]). 
(Many  alternative  models  have  been  proposed  for  bal¬ 
anced  dynamics;  among  the  better  ones,  their  similari¬ 
ties  seem  more  important  than  their  differences.)  The 
balance  equations  contain  no  inertia-gravity  wave  solu¬ 
tions;  so  they  are  often  taken  as  a  dynamical-systems 
model  for  the  (advective)  slow  manifold.  They  have 
second-order  asymptotic  accuracy  as  Ro  Fr  0, 
whereas  the  traditional  geostrophic  balance  and  quasi- 
geostrophic  equations  have  only  first-order  accuracy 
{Gent  and  McWilliams  [1983]).  They  have  been  shown 
in  many  analyses  to  accurately  represent  the  observed 
state  and  evolution  of  large-scale  flows  in  the  atmo¬ 
sphere  and  ocean;  for  example,  they  often  are  used  for 
initialization  of  weather  forecasts,  even  ones  for  hur¬ 
ricanes.  An  important  aspect  of  this  accuracy  is  the 
weakness  of  inertia-gravity  wave  generation  by  balanced 
motions  when  Ro  and  Fr  are  not  large. 

The  advective  dynamics  of  balance  or  quasigeostro- 
phic  equations — called  geostrophic  turbulence — yields 
an  inverse  turbulent  cascade  of  energy  towards  larger 
scales  in  (x,  y,  z),  hence  away  from  dissipation  by  molec¬ 
ular  viscosity  at  small  scales,  and  a  forward  cascade  of 
potential  enstrophy  (i.e.,  variance  of  potential  vortic- 
ity)  to  its  dissipation  at  small  scales  {Chamey  [1971]; 
McWilliams  et  al.  [1994]).  (This  behavior  is  analogous 
to  the  turbulence  in  a  two-dimensional  fluid.)  In  the 
enstrophy  inertial  range,  Ro  and  Fr  are  invariant  as 
the  scale  decreases,  at  least  in  the  limit  of  Ro,  Fr  0. 
There  is  as  yet  much  less  experience  with  balanced  tur¬ 
bulence  at  finite  values  of  Ro  and  Fr,  but  available 
solutions  indicate  that  it  is  only  modestly  more  dissipa¬ 
tive  of  energy  ( Yavneh  et  ai  [1997]).  It  remains  an  open 
question  how  consistently  Ro  and  Fr  avoid  increasing  in 
the  forward  cascade,  either  in  the  balance  equations  or 
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more  fundamental  fluid  d3aiamics:  at  small  scales  does 
the  cascade  in  balanced  turbulence  generate  inconsis¬ 
tencies  with  its  justifying  assumptions  and  how  leaky 
is  the  slow  manifold  to  unbalanced  motions?  Never¬ 
theless,  our  present  imderstanding  is  that  the  balance 
equations  do  not  provide  an  efficient  route  to  energy  dis¬ 
sipation  away  from  boundaries:  they  imply  more  stir¬ 
ring  than  mixing  for  momentum. 

Loss  of  Balance 

An  analysis  for  the  solvability  of  the  balance  equa¬ 
tions  is  made  in  Yavneh  et  al  [1997]  and  McWilliams 
et  al.  [1998],  To  be  able  to  integrate  forward  in  time 
from  a  balanced  state,  several  necessary  conditions  must 
be  satisfied  everywhere  within  the  domain.  Where  these 
are  violated,  there  is  a  change  of  type  of  the  partial 
differential  system  and  the  initial-  and  boimdary- value 
problem  becomes  ill-posed.  For  the  balance  equations 
in  isopycnal  coordinates  (i.e.,  Gent  and  McWilliams 
[1984])  and  /  >  0,  the  conditions  for  loss  of  balance 
are  the  following: 

1.  Change  of  sign  of  vertical  stratification,  N^  = 

db. 

dz^ 

2.  Change  of  sign  of  absolute  vorticity,  A  =  f  -\- 

^(z)  _  (where  the  horizontal  deriva¬ 

tives  denoted  by  capital  letters  are  in  isentropic 
coordinates); 

3.  Change  of  sign  of  A  —  |5|  (where  5^  ==  {ux  “ 
vy)^  {vx  +  uy)^  is  the  variance  of  the  strain 
rate). 

None  of  these  conditions  occurs  in  the  quasigeostrophic 
limit,  since  A,A-\S\  f  +  0{Ro)  and  JV  -h 
O(jRo),  where  No{z)  is  the  resting-state  stratification. 
Note  the  greater  susceptibility  of  anticyclonic  regions 
(i.e.,  with  C^^V/  <  0)  tfi®  second  and  third  con¬ 
ditions;  furthermore,  note  the  greater  susceptibility  to 
the  third  condition,  since  A  -  |5|  <  A.  The  first  and 
second  conditions  also  axe  related  to  the  potential  vor¬ 
ticity,  q  =  AN^.  Since  potential  vorticity  is  conserved 
on  parcels,  except  for  mixing  effects,  there  is  thus  an 
evolutionary  inhibition  for  an  unforced  flow  to  spon- 
taneotisly  develop  a  violation  of  the  first  and  second 
conditions.  However,  there  is  no  such  constraint  with 
respect  to  the  third  condition,  which  indicates  another 
sense  in  which  there  may  be  a  greater  susceptibility  to 
the  third  condition. 


Instability  and  Cascade 

We  now  ask  what  happens  when  there  is  a  loss  of 
balance  as  defined  above.  Obviously  any  further  in¬ 
tegration  of  the  balance  equations  is  precluded.  So 
the  question  must  be  answered  in  a  more  fundamental 
fluid  djoiamics,  such  as  the  incompressible  Boussinesq 
equations,  which  have  no  restriction  on  Ro  and  Fr  for 
their  validity.  In  general,  our  expectation  is  that  some 
inertia-gravity  waves  and/or  more  nearly  isotropic  tur¬ 
bulence  will  be  generated — instigating  some  degree  of 
enhanced  dissipation — where  there  is  a  loss  of  balance. 
However,  the  efficiency  of  their  generation  is  uncertain, 
as  is  whether  the  subsequent  evolution  systematically 
departs  from  balanced  dynamics  or  relaxes  back  to¬ 
wards  it  (e.g.,  as  a  geostrophic  adjustment  or  selective 
cascade  and  dissipation  processes).  In  the  rest  of  this 
article,  we  address  the  issue  of  generation  and  subse¬ 
quent  evolution  only  in  a  very  particular  context,  viz., 
the  linear  instabilities  of  rotating,  stratified  flows  which 
are  steady,  inviscid,  balanced  solutions  of  the  Boussi¬ 
nesq  equations.  While  this  is  far  from  the  general  cir¬ 
cumstances  of  fluid  evolution,  it  does  provide  a  cleanly 
posed  question  that  also  is  one  that  can  be  answered 
in  part  by  reference  to  the  extensive  literature  on  fluid 
instabilities. 

Instability  Types 

Now  we  attempt  to  interpret  the  known  instabilities 
in  relation  to  the  conditions  for  loss  of  balance  for  ro¬ 
tating,  stratified  flows  where  Ro  and  Fr  are  not  large. 
It  is  probably  unprovable  that  any  such  taxonomy  of 
instability  types  can  be  complete  and  imique;  however, 
after  all  the  research  that  has  gone  into  this  topic,  the 
landscape  has  become  fairly  well  mapped. 

Quasigeostrophic  Inflectional  Instability 

Consider  the  instability  of  a  geostrophic  parallel 
flow  U{y,z)  with  background  stratification  No{z)  and 
Coriolis  frequency  f{y)  in  the  quasigeostrophic  limit, 
iio,  Fr  — >  0.  Following  Rayleigh  [1880]  and  Drazin  and 
Howard  [1966]  and  ignoring  vertical-boundary  effects, 
one  can  derive  a  “Rayleigh  theorem”  from  the  potential 
vorticity  equation  that  a  necessary  condition  for  inviscid 
instability  of  a  non-symmetric  (i.e.,  dx  ^  0),  normal¬ 
mode  fluctuation  is  that  the  mean  potential  vorticity 
gradient, 
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change  sign  within  the  domain.  This  is  called  an 
inflection-point  instability  since  it  involves  the  hori¬ 
zontal  and/or  vertical  curvature  of  U;  depending  upon 
which  curvature  is  dominant,  the  instability  is  labeled 
barotropic  or  barocHnic  instability.  The  unstable  mode 
is  itself  geostrophically  balanced.  This  is  the  only  type 
of  instability  that  occurs  in  the  quasigeostrophic  limit. 
It  has  an  analytic  continuation  to  finite  Ro  and  Fr, 
where  it  can  be  expected  to  remain  balanced  over  some 
range  of  these  parameters.  Thus,  its  onset  conditions 
have  nothing  to  do  with  the  limits  of  balance,  and  it 
represents  a  mode  of  stirring  within  balanced  dynam¬ 
ics. 

Gravitational  Instability 

The  condition  for  the  onset  of  gravitational  instabil¬ 
ity  in  the  limit  of  vanishing  viscosity  is  AT^  <  o  (Rayleigh 
[1916];  Chandrasekhar  [1961]).  This  coincides  with  the 
first  condition  for  loss  of  balance,  and  the  mode  of  insta¬ 
bility  is  unbalanced  (e.g.,  the  vertical  momentum  bal¬ 
ance  is  non- hydrostatic). 

Symmetric  Centrifugal  Instability 

For  a  balanced,  parallel  flow  U{y,z),  the  neces¬ 
sary  and  sifificient  condition  for  inviscid  instability  of 
a  parallel-S)Tnmetric  perturbation  is  a  change  of  sign 
of  potential  vorticity  q{y,z)  (Hoskins  [1974]).  This  co¬ 
incides  with  the  second  condition  for  loss  of  balance. 
(This  is  a  2D  problem  rather  than  a  3D  one,  and 
McWilliams  et  al.  [1998]  show  that  the  third  condition 
is  not  germane  in  this  situation.)  For  a  balanced,  ax- 
isymmetric,  azimuthal  flow  U(r,z),  the  conditions  for 
the  inviscid  instability  of  an  axisymmetric  perturbation 
are  the  change  of  sign  of  either  the  absolute  vorticity 
A  or  the  absolute  circulation  C  =  -fr  4-  U  (Rayleigh 
[1916];  Ooyama  [1966]).  McWilliams  et  al.  [1998]  show 
that  these  coincide  with  the  second  or  third  conditions, 
respectively,  for  loss  of  balance  in  this  case.  Thus,  the 
boundaries  for  onset  of  symmetric  centrifugal  instabil¬ 
ity,  which  has  unbalanced  growing  modes,  occur  exactly 
at  the  limits  of  balanced  evolution. 

Elliptical  Instability 

The  inviscid  instability  of  the  balanced,  elliptical, 
two-dimensional,  barotropic  flow  in  an  unbounded  do¬ 
main, 

,  1  >  ^  >  0,  (6) 

was  originally  analyzed  for  /  =  AT  =  0  (Pierrehumbert 
[1986];  Bayly  [1986];  Craik  and  Criminale  [1986])  and 


Figure  1.  Growth  rate,  o/f,  for  the  elliptical  flow  (6), 
maximized  over  vertical  wavenumber.  Curves  are  shown  for 
P/a  =  0.25  (dashed),  0.11  (dotted),  and  0.026  (solid).  The 
corresponding  abscissa  values  for  the  second  condition  for 
loss  of  balance  are  -0.6,  -0.8,  and  -0.95  (see  text). 


(A-lSi)/f 


Figure  2.  As  figure  1,  but  with  logarithmic  ordinate.  The 
evident  noise  is  because  of  intermittent  underestimates  of  a 
due  to  incomplete  optimization  searches  over  the  very  nar¬ 
row  unstable  bands  in  vertical  wavenumber. 
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later  extended  to  Ro,Fr  <  oo  by  Miyazaki  [1993]. 
The  unstable  modes  occur  in  bands  of  the  vertical 
wavenumber.  They  have  temporally  oscillatory  hori¬ 
zontal  wavenumbers  and  exhibit  exponential  growth  av¬ 
eraged  over  a  wavenumber  oscillation  period. 

The  problem  was  revisited  in  McWilliams  and  Yavneh 
[1998]  from  the  perspective  of  loss  of  balance:  ellipti¬ 
cal  instability  disappears  in  the  quasigeostrophic  limit 
and  its  onset  nearly,  but  not  precisely,  coincides  with 
the  third  condition  for  loss  of  balance  (see  Figures  1- 
2).  The  unstable  mode  is  unbalanced.  For  this  basic 
flow  the  vorticity  and  strain  rate  are  spatially  uniform 
and  imequal  in  magnitude.  The  abscissa,  (A  -  \S\)/f, 
=  +1  in  the  quasigeostrophic  limit,  =  0  at  the  third 
condition  for  loss  of  balance,  =  -{a  -  /3){a  +  0)  at 
the  second  condition  for  loss  of  balance,  and  >  -f-l  for 
cyclonic  flows.  Cyclonic  elliptical  flows  are  stable  for 
CC./3  =  Oif), 

Taylor- Couette  and  Barotropic  Instabilities 

Consider  Taylor-Couette  flow  in  the  gap  between  two 
axisymmetric,  rotating  cylinders, 

U{r)  =  Ar  +  j,  (7) 

which  is  a  viscous  steady  solution  commonly  studied 
in  laboratory  experiments.  The  classical  instabihty  for 
this  flow  is  centrifugal  (Taylor  [1923];  Chandrasekhar 
[1961]),  whose  onset  in  the  inviscid  limit  coincides  with 
the  second  condition  for  loss  of  balance.  In  a  neu¬ 
trally  stratified  fluid,  this  is  the  only  type  of  linear  in¬ 
stability,  since  this  profile  does  not  have  an  inflection 
point.  However,  for  a  stably  stratified  fluid  with  small 
Fr,  this  barotropic  shear  flow  has  another  class  of  im¬ 
balanced  instabilities  at  finite  Ro  in  the  anticyclonic 
regime  (Molemaker  et  al  [2001];  Yavneh  et  al.  [2001]), 
but  there  is  not  any  instability  for  the  quasigeostrophic 
limit,  Ro  0,  nor  for  cyclonic  flows  with  Ro  =  0(1). 
For  this  new  class  an  infinite  but  countable  set  of  un¬ 
stable  modes  exist,  which  differ  in  their  cross-stream 
structure,  each  with  a  different  narrow  band  of  verti¬ 
cal  wavenumbers.  In  Figures  3-4,  the  growth  rates  are 
shown  for  the  first  three  modes  in  the  thin-gap  limit 
where  and  S  are  nearly  uniform  and  equal  in  mag¬ 
nitude.  As  with  elliptical  flow,  the  strength  of  the  in- 
stabihty  strongly  increases  in  the  neighborhood  of  the 
third  condition  for  loss  of  balance.  (And,  in  this  case, 
(A- 15|)/ /  w  — 1  corresponds  to  the  second  condition.) 
The  instability  can  be  shown  to  involve  a  resonance  of 
shear-modified  neutral  modes  (as  required  in  any  eigen¬ 
value  problem  whose  eigenfrequency  is  either  real  or  a 


Figure  3.  Growth  rate,  cr/f^  for  Taylor-Couette  flow  (7)  in 
a  narrow  gap,  maximized  over  vertical  wavenumber.  Curves 
are  shown  for  the  three  gravest  modes  in  cross-stream  struc¬ 
ture. 

complex-conjugate  pair);  for  the  gravest  unstable  mode 
the  resonance  involves  a  pair  of  Kelvin  waves  propagat¬ 
ing  cyclonically  along  each  cylinder  wall,  and  for  the 
other  modes,  one  of  the  Kelvin  modes  is  replaced  by 
an  inertia-gravity  mode.  Thus,  the  instability  has  an 
unbalanced  djoiamics.  An  explicit  formula  can  be  ob¬ 
tained  for  the  unstable  growth  rate  of  the  gravest  mode, 

asymptotically  as  Ro  =  max|C^*^|//  -t  O"*"  and 
X  =  (A  -  \S\)/f  -4  1“  (with  7  =  2  (analytically) 
for  the  gravest  mode  and  7  «  3  (computationally)  for 
the  higher  modes;  n.b.,  X  is  the  abscissa  in  Figures  3- 
4).  This  asymptotic  regime  is  accmately  realized  even 
in  the  neighborhood  of  the  third  condition  for  loss  of 
balance,  which  implies  that  there  is  an  extremely  rapid 
weakening  of  a  with  Ro  and  X  in  this  neighborhood, 
but  not  an  abrupt  cessation  at  any  critical  value  near 
X  =  0. 

Recently  we  have  also  solved  the  linear,  inviscid  sta¬ 
bility  problem  for  a  barotropic  boundary  current  along 
a  straight  coastline  (at  x  =  0), 

V{x)  =  Voexp{-a  x),  (9) 

in  a  uniformly  rotating,  stratified  fluid  in  a  semi-infinite 
domain  (i.e.,  x  >  0).  This  profile  also  does  not  have  an 
inflection  point.  In  addition  to  no  normal  flow  at  the 
boundary,  we  prescribe  a  radiation  condition  at  a  sufiB- 
ciently  distant  location  in  the  interior  by  matching  the 
solution  to  an  outwardly  radiating,  free  inertia-gravity 
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Figure  4.  As  figure  3,  but  with  logarithmic  ordinate 


Figure  5.  Growth  rate,  cr/f,  for  the  pzLrallel  boundary 
current  (9),  maximized  over  along-stream  and  vertical 
wavenumbers.  The  abscissa  value  is  based  upon  the 
minimum  over  x  which  occurs  at  x  =  0. 


wave, 


du 

dx 


=  iku, 


(10) 


where  fc  is  a  cross-stream  wavenumber  determined  from 
vertical  and  along-stream  wavenumbers  and  eigenfre- 
quency  using  the  dispersion  relation  of  an  inertia-gravity 
wave.  Again  there  are  imbalanced  unstable  modes  for 
anticyclonic  flows  (i.e.,  Vo/ f  >0)  away  from  the  quasi- 
geostrophic  limit.  In  contrast  with  Taylor- Couette  flow, 
with  its  discrete  spectrum  of  modes  in  the  cross-flow 
direction,  here  there  is  a  continuous  set  of  unstable 
modes  for  any  along-stream  and  vertical  wavenumber 
pair  (and  its  corresponding  cross-stream  k).  In  Fig¬ 
ures  5-6,  growth  rates  are  shown  for  optimal  along-flow 
and  vertical  wavenumbers,  indicating  yet  again  a  rapid 
increase  of  the  growth  rate  in  the  vicinity  of  the  third 
condition  for  loss  of  balance;  the  functional  form  of 
(t[Ro]  here  appears  to  be  close  to  that  for  the  higher¬ 
mode,  Taylor-Couette  instabilities.  Again,  a  resonance 
can  be  diagnosed  with  a  shear-modified  Kelvin  mode 
and  an  inertia-gravity  mode. 

In  summary,  the  three  different  barotropic,  anticy¬ 
clonic,  rotating,  stratified,  shear  flows  analyzed  in  this 
and  the  preceding  section  all  have  an  unbalanced  insta¬ 
bility  whose  strength  rapidly  increases  in  the  vicinity 
of  the  third  condition  for  loss  of  balance  (in  addition 
to  the  unbalanced,  centrifugal  instability  in  the  vicin¬ 
ity  of  the  second  condition).  In  the  hydrostatic  limit, 
this  instability  is  related  to  ageostrophic,  parallel,  anti¬ 
cyclonic  flow  instability  in  a  shallow-water  layer  (Sato- 
mura  [1981];  Griffiths  et  al.  [1982],  et  seq.) 


Baroclinic  and  Kelvin-Helmholtz  Instabilities 

Consider  a  steady,  spatially  uniform  vertical  shear 
flow,  U{z)  oc  z,  in  a  uniformly  stratified  and  rotating 
fluid  in  a  vertically  bounded  domain  with  non-isopycnal 
boundaries.  We  define  the  Richardson  number  for  this 
flow  by  Ri  =  N^/U^,  which  is  equivalent  to  a  Fr-^. 
The  balancing  buoyancy  field  is  B(j/,  z)  =  N^z-fU^y, 
and  the  associated  quantities  in  the  second  and  third 
conditions  for  loss  of  balance  are 


A 


A-S 


(11) 


Notice  that  this  flow  regime  is  anticyclonic  since  A//  < 
1.  Thus,  we  can  make  the  following  categorization  for 
this  flow  in  terms  of  Ri  and  in  relation  to  the  conditions 
for  loss  of  balance: 


Figure  6.  As  figiire  5,  but  with  logarithmic  ordinate 


•  The  quasigeostrophic  limit  occurs  for  Ri  ^  oo.  It 
has  a  geostrophic  baroclinic  instability  {Chamey 
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[1947];  Eady  [1949]),  whose  “inflection  point”  in 
this  particular  case  occurs  at  the  vertical  bound¬ 
aries  (see  above). 

•  The  third  condition  is  satisfied  if  Ri  <  2. 

•  The  second  condition  is  satisfied  if  iJz  <  1. 

•  The  classical  (non-rotating)  condition  for  the  on¬ 
set  of  Kelvin-Helmholtz  instability  {Miles  [1961]; 
Howard  [1961])  is  satisfied  if  Ri  < 

•  The  first  condition  is  satisfied  if  Ri  <  0;  this  is 
•gravitational  instability  (see  above). 

Analyses  of  this  problem  in  Stone,  [1966,1970]  and 
Nakamura  [1988]  show  that  a  centrifugal  instability  does 
occur  near  Ri  =  1  for  both  zonally  symmetric  and 
asymmetric  fluctuations  and  that  another  ageostrophic 
instability  (with  shorter  zonal  wavelengths  than  the 
geostrophic  instability)  occurs  for  even  larger  values  of 
Ri,  though  its  onset  value  is  not  well  determined.  This 
latter  instability  is  shown  in  Figure  2  of  Stone  [1970], 
with  an  accompanying  remark  that  it  has  “growth  rates 
[Imcr]  which  decrease  as  Ri  increases,  and  may  in  fact 
still  be  unstable  for  Ri  >  2,  but  if  so  the  growth  rates 
were  too  small  to  to  be  foimd  by  our  numerical  method, 
because  of  the  near  singular  behavior  of  the  coefficients 
of  the  differential  equation  when  Ima  is  very  small.” 
Confirmation  is  presented  in  Figure  8  of  Nakamura 
[1988]  which  shows  that  Im(r  is  strongly  decreasing  as 
Ri  increases;  it  is  somewhat  unclear  exactly  how  large 
a  Ri  value  he  obtained  unstable  solutions  for,  but  his 
figure  suggests  it  is  at  least  as  large  as  Ri  =  2,  An 
interpretation  of  the  shortwave  instability  is  presented 
(p.  3261):  it  involves  a  resonance  between  a  boundary- 
trapped  shear  mode  and  an  inertia-gravity  mode,  with 
an  “inertia  critical  level”  that  limits  the  vertical  extent 
of  the  eigenmode.  Thus,  although  further  examination 
is  needed  of  the  behavior  of  Im<7  in  the  vicinity  of  the 
third  condition  for  loss  of  balance,  the  instabilities  for 
this  flow  appear  to  support  quite  well  the  h3q>othesis  we 
have  advanced. 

The  Rest  of  the  Quest 

We  have  presented  here  a  survey  of  the  known  insta¬ 
bilities  of  rotating,  stably  stratified  fluids.  Barotropic 
and  baroclinic  inflectional  instabilities  are  the  only 
types  that  occur  in  the  quasigeostrophic  limit,  and 
several  others  arise  in  the  neighborhood  of  the  limits 
of  balance  as  determined  jfrom  the  balance  equations. 
These  latter  include  gravitational,  symmetric  centrifu¬ 
gal,  and  elliptical,  as  well  as  presently  unnamed  variants 


of  barotropic  and  baroclinic  instabilities.  The  insta¬ 
bility  onset  is  generally  understood  to  be  sharp  with 
respect  to  the  N^  and  A  conditions  for  loss  of  bal¬ 
ance,  which  correspond  to  the  gravitational  and  cen¬ 
trifugal  instabilities.  We  have  shown  for  the  elliptical, 
Taylor-Couette,  and  barotropic  boundary-current  insta¬ 
bilities  that  the  onset  behaviors  are  smooth  in  the  tran¬ 
sition  across  the  A  -  |5|  condition,  though  still  quite 
steep,  with  exponential  or  steeper  dependences  for  the 
growth  rate,  a{A  —  151).  The  onset  behavior  near  the 
A  — 15|  condition  has  yet  to  be  as  well  determined  in  the 
ageostrophic  baroclinic  instability  problem,  but  previ¬ 
ous  studies  indicate  it  is  also  steep.  Kelvin-Helmholtz 
instability  does  not  fit  in  this  categorization  scheme  be¬ 
cause  its  onset  occurs  well  beyond  the  A  and  A  —  |5| 
conditions. 

Thus,  we  conclude  that  these  problems  give  confir¬ 
mation,  pro  tern,  of  the  hypothesis  that  the  limits  of 
balance  are  indicative  of  transition  from  the  larger-scale 
regime  of  inverse  energy  cascades  in  anisotropic  flows  to 
the  smaller-scale  regimes  of  forward  energy  cascade  to 
dissipation  of  more  nearly  isotropic  flows  and  intermit¬ 
tently  breaking  inertia-gravity  waves.  There  is  still  need 
for  refining  our  imderstanding  of  <x{Ri,Ro,Fr)  in  the 
neighborhood  of  the  conditions  for  loss  of  balance,  espe¬ 
cially  for  the  less  familiar  third  condition.  Now  that  we 
know  that  pure  barotropic  and  baroclinic  instabilities  fit 
the  hypothesis,  it  would  be  worthwhile  to  further  test 
it  with  more  general  shear  profiles  (e.g.,  the  “coastal” 
V{x,z)  profile  that  Barth  [1994]  has  shown  to  have  an 
ageostrophic  baroclinic  instability).  Beyond  these  lin¬ 
ear  instability  problems  lies  the  challenge  of  aptly  diag¬ 
nosing  flow  evolution  near  the  loss  of  balance  conditions 
in  general  nonlinear  initial-  and  boundary-value  prob¬ 
lems.  Nevertheless,  we  interpret  the  results  in  hand  as 
indicating  that  the  loss-of-balance  transitions  are  steep 
but  fuzzy,  consistent  with  the  view  that  the  balanced 
slow  manifold  is  itself  modestly  fuzzy  and  leaky  to  more 
eflficiently  cascading,  imbalanced  motions. 
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Abstract.  A  series  of  idealized  numerical  experiments  is  presented  to  study 
the  evolution  of  an  Agulhas  ring.  The  modeled  ring  is  a  circular  symmetric 
idealization  of  ring  Astrid,  as  measured  by  a  cruise  in  March  2000.  In 
particular,  the  hypothesis  is  tested  that  mixing  is  mainly  due  to  cross-frontal 
secondary  circulations.  Both  the  role  of  beta-decay  and  shear  instabilities  in 
setting-up  cross-frontal  mixing  is  studied.  The  largest  mixing  of  ring  water 
with  its  smroundings  occurs  in  the  first  months  after  shedding  when  the 
heat  loss  is  maximal.  Therefore,  it  is  tested  whether  buoyancy  forcing  is 
able  to  enhance  cross- frontal  circulations.  It  is  concluded  that  for  realistic 
initial  conditions  the  observed  decay  can  be  accoimted  for  by  adiabatic  decay 
associated  with  cross-frontal  circulations.  The  adiabatic  decay  appears  to  be 
a  nonlinear  function  of  the  ring  strength,  possibly  enhanced  by  the  instability 
of  the  ring.  The  relative  decay  for  weaker,  stable  rings  is  more  than  twice 
as  weak.  For  a  wide  parameter  range  all  modeled  rings  are  unstable  for  a 
mode-2  perturbation  and  split.  Observations  suggest  that  most  rings  do  not 
split  and  are  not  as  unstable  as  suggested  by  the  model.  What  causes  this 
mismatch  and  how  it  affects  the  decay  rate  is  still  imclear.  Realistic  cooling 
is  observed  to  slightly  ret^d  the  split  and  marginally  stabilize  the  ring.  The 
decay  rate  is  enhanced  with  typically  30%.  For  weaker  rings  the  impact  of 
cooling  becomes  less. 


Introduction 

In  Autumn  1999,  the  Netherlands  consorted  obser¬ 
vational  and  modeling  program  MARE  (Mixing  of  Ag¬ 
ulhas  rings  Experiment)  was  started  {Lutjeharms  et  a/., 
2000).  The  main  goal  is  to  estimate  the  proportion  of 
Agulhas  leakage  that  contributes  to  the  upper  branch  of 
the  thermohaJine  circulation  and  identify  the  dominant 
mixing  processes  that  determine  that  proportion.  The 
experiment  is  motivated  by  the  observation  that  the  sea 
surface  height  anomaly  of  Agulhas  rings  most  rapidly 
decays  just  after  shedding  (Schouten  et  al,  2000).  Fig¬ 
ure  1  shows  the  mean  sea  surface  height  (SSH)  anomaly 
of  11  Agulhas  rings  as  a  function  of  age.  During  the  first 
5  months  the  decay  is  fast:  5  cm  per  month.  After  10 
months  the  rings  decay  very  slowly:  0.5  cm  per  month. 
As  a  result,  in  the  early  phase  when  Agulhas  rings  cross 
the  Benguela  Current,  the  associated  density  anomaly 
for  a  large  part  is  mixed  away  into  the  surroundings. 
The  Benguela  Current  is  thought  to  be  the  main  agent 
for  the  North  Atlantic  Deep  Water  (NADW)  return  flow 


in  that  region  (De  Ruijter  et  al,  1999).  So,  the  mix¬ 
ing  process  (es)  associated  with  the  fast  decay  regime  in 
the  early  phase  of  Agulhas  rings  mainly  determines  the 
proportion  of  Agulhas  leakage  that  contributes  to  the 
NADW  return  flow. 

To  determine  the  mixing  associated  with  the  fast  de¬ 
cay  regime  from  measurements  only,  requires  an  ob¬ 
servational  database  that  contains  an  amount  of  de¬ 
tail  which  is  not  feasible  with  present  day  measurement 
techniques.  Therefore,  MARE  consists  of  a  hierarchy  of 
modeling  efforts  to  support  the  observational  program. 
Here,  we  report  a  series  of  model  simulations  initialized 
with  a  circular  symmetric  idealization  of  ring  Astrid,  as 
measured  by  the  first  MARE  cruise.  In  particular,  we 
test  the  hypothesis  that  mixing  is  mainly  due  to  cross- 
frontal  secondary  circulations.  That  is,  the  associated 
mixing  is  predominantly  adiabatic  (stirring).  The  stir¬ 
ring  itself  may  have  various  dynamical  origins  which 
can  be  adiabatic  (e.g.,  /3-decay,  baroclinic  instability), 
or  diabatic  (e.g.,  cooling,  double  diffusion). 
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Figure  1.  Mean  SSH  anomaly  (and  bars  of  1  standard 
deviation)  of  Agulhas  rings  plotted  against  their  age  from 
Schouten  et  al  (2000). 

Both  observations  and  modeling  efforts  have  shown 
that  secondary  circulations  may  arise  due  to  inter¬ 
nal  instabilities  (e.g.,  Pollard  and  Regier,  1992;  Spall, 
1995).  In  the  case  of  convective  chinmeys,  cooling  sets 
the  stage  for  baroclinic  instability.  Stirring  is  associ¬ 
ated  with  baroclinic  instability-driven  intrusions  (Legg 
and  McWilliams,  2000).  As  the  largest  mixing  of  ring 
Water  with  its  surroundings  coincides  with  maximum 
heat  loss,  we  test  whether  cooling  is  able  to  main¬ 
tain/enhance  cross-firontal  stirring  and  whether  this  is 
the  main  driving  agent  for  the  decay  of  Agulhas  rings. 

Experimental  setup 

The  simulations  we  analyze  have  been  performed 
with  MICOM  2.7,  which  is  an  extended  version  of  the 
ocean  general  circulation  model  described  by  Bleck  and 
Smith  (1990).  The  horizontal  resolution  is  0.05®  in  lon¬ 
gitude  and  latitude  and  there  are  12  layers  in  the  ver¬ 
tical.  Computations  were  carried  out  on  a  201  x  201 
periodic  domain  with  a  flat  bottom  at  4000  m  depth. 
The  model  was  initialized  with  a  circular  symmetric  ide¬ 
alization  of  ring  Astrid.  First,  for  12  selected  depth  in¬ 
tervals,  the  average  (Jq  outside  the  ring  was  calculated 
from  the  measurements.  These  ao  values  were  assigned 
to  the  isopycnic  layers.  Then,  the  <to  value  at  50  m 
depth  outside  the  ring  (the  bottom  of  layer  1)  was  es¬ 
timated,  and  the  depth  of  this  gq  level  was  assessed  as 
a  function  of  the  radial  distance  from  the  ring  center. 
The  curve,  so  obtained,  was  fitted  to  a  linear  profile 
(solid  body  rotation)  inside  the  ring,  and  an  exp(~r)^ 
profile  outside  the  ring.  The  bounding  edge  between  in¬ 
side  and  outside  was  defined  by  the  velocity  maximum 


of  the  ring. 

The  same  profile  was  used  for  all  layer  depths,  imply¬ 
ing  an  equivalent  barotropic  structure.  The  amplitude 
as  a  function  of  layer  number  was  obtained  by  eval¬ 
uating  the  (To  values  at  the  imdisturbed  layer  depths 
outside  the  ring,  and  then  calculating  the  depth  of  this 
(To  level  at  the  ring  center.  The  temperature  profile  was 
obtained  in  an  analogous  way  and  the  salinity  profile 
was  calculated  from  the  associated  temperature  and  (Tq 
values.  The  barotropic  pressure  gradient  was  assumed 
to  compensate  the  baroclinic  pressure  gradient  in  the 
deepest  layer.  The  initial  velocity  field  was  obtained  by 
demanding  cyclogeostrophic  balance. 

Due  to  the  periodic  boundary  conditions  there  is 
a  discontinuity  in  /  and  potential  vorticity  at  the 
north/south  boundary.  This  does  not  affect  the  ring 
structure,  which  is  initialized  in  the  middle  of  the  do¬ 
main,  but  it  affects  the  barotropic  waves  that  radiate 
away  from  the  ring.  As  the  barotropic  waves  re-enter 
the  domain  due  to  the  periodic  boundary  conditions, 
they  interact  with  the  ring  and  this  interaction  is  influ¬ 
enced  by  the  north/south  potential  vorticity  disconti¬ 
nuity.  The  impact  on  the  ring  of  this  interaction  with 
the  barotropic  wave  field  is  assumed  to  be  small  and 
not  to  aiffect  the  evolution  characteristics  of  the  ring. 

In  a  few  simulations  the  model  was  forced  with  uni¬ 
form  cooling  to  16 °C,  with  an  average  wind  field  of  7 
m/s.  This  value  was  estimated  from  the  wintertime 
SST  for  the  same  ring,  as  measured  during  the  second 
MARE  cruise.  Initially,  temperature  in  the  surface  layer 
was  18.43°C  outside  the  ring,  20.53 ®C  at  the  ring  cen¬ 
ter.  A  uniform  cooling  to  16,0  ®C  implies  entraining 
layer  2  and  part  of  layer  3  with  initial  values  of  12.82  ®C 
outside  the  ring  and  14.58®C  at  the  ring  center.  Layer 
3  is  between  100  and  200  m  depth  outside  the  ring, 
between  320  and  520  m  depth  at  the  ring  center. 

Results 

Stability  characteristics 

The  stability  of  the  modeled  ring  appeared  to  be 
highly  sensitive  to  details  of  the  initial  profile,  that  is, 
both  the  horizontal  shape  and  vertical  profile.  Hori¬ 
zontal  shapes  that  feature  vorticity  maxima  within  the 
ring  (e.g.,  an  exp(-r)^  profile)  generally  induce  quickly 
developing  instabilities.  Also,  the  vertical  profile  mat¬ 
ters.  Two  different  profiles  with  the  same  two-layer  inte¬ 
grated  properties  (<to  and  interface  displacement)  may 
show  completely  different  stability  behavior.  In  this 
case,  the  amplitude  of  the  barotropic  and  first  baro¬ 
clinic  mode  is  the  same,  but  the  amplitude  of  the  higher 
baroclinic  modes  differ.  Apparently,  the  amplitude  of 
the  higher  baroclinic  modes  have  a  strong  impact  on 
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Figure  2.  Thickness  of  the  upper  layer  at  days  50,  60,  70  Figure  3.  Thickness  of  the  upper  layer  at  days  110  130 
and  80  with  alpha=l.  Contour  interval  is  20  m.  150  and  170  with  alpha=0.33.  Contour  interval  is  5  m.  ’ 


the  stability  characteristics  of  the  ring,  and  as  a  result, 
the  two-layer  stability  results  of  Dewar  et  al.  (1999)  do 
not  simply  extrapolate  to  a  multilayer  setting. 

All  runs  we  discuss  here  use  nearly  the  same  parame¬ 
ter  setting,  which  is  derived  from  ring  Astrid.  The  only 
parameter  we  vary  is  the  overall  strength.  In  all  runs 
both  the  barotropic  pressure  gradient  and  interface  dis¬ 
placements  are  multiplied  \wth  one  and  the  same  factor 
a.  With  a  =  1  the  ring  mimics  ring  Astrid.  This  ring  is 
unstable  for  a  mode-2  perturbation  and  the  ring  splits 
after  80  days.  Figure  2  shows  the  evolution  of  upper 
layer  thickness  between  day  50  and  80.  At  day  50  the 
ring  has  become  ellipsoidal.  At  day  60  it  is  even  more 
elongated.  At  day  70  a  neck  is  forming  in  the  middle, 
and  at  day  80  the  ring  is  split  into  a  slightly  weaker  and 
a  stronger  half. 

The  ring  remains  unstable  while  decreasing  a  to  0.5. 
When  we  decrease  a  further  to  0.44  the  ring  becomes 
stable,  that  is,  the  ring  sheds  off  satellites  that  become 
increasingly  smaller  and  weaker  relative  to  the  parent 
ring  with  decreasing  a.  Figure  3  shows  the  evolution 
of  upper  layer  thickness  between  day  110  and  170  for 
a  =  0.33.  The  ring  becomes  much  less  ellipsoidal.  It  de¬ 
velops  streamers  from  which  at  intervals  small  satellites 
are  cut  off. 

The  strong  tendency  of  ring  Astrid  to  split  is  remark¬ 
able.  Schouten  et  al.  (2000)  tracked  20  Agulhas  rings 
using  TOPEX/Poseidon  satellite  altimetry.  Of  those 


20,  6  split,  and  14  remained  stable.  As  the  strength 
of  Astrid  appears  not  to  be  a  critical  parameter,  ei¬ 
ther  the  density  stratification,  or,  the  vertical  profile  of 
the  rings  should  enhance  the  stability.  In  our  calcu¬ 
lations  a  change  in  stratification  did  not  have  a  very 
strong  impact,  while  a  change  in  vertical  profile  in  gen¬ 
eral  made  the  ring  more  imstable.  An  alternative  expla¬ 
nation  could  be  that  for  the  observed  rings  (nonlinear) 
interaction  with  the  background  flow,  or,  with  other 
rings  and  cyclones  could  prevent  splitting.  Also,  the 
bathymetry  could  stabilize  the  rings. 

Decay 

When  calculating  the  SSH  anomaly  as  a  function  of 
time  for  the  unforced  runs  with  a  =  1,  it  appears 
the  decay  rate  is  consistent  with  the  fast  decay  regime 
shown  in  Fig.  1.  The  cross-frontal  mass  exchange  is  as¬ 
sociated  with  a  single-cell  secondary  circulation  that  im- 
phes  upwelling  at  the  back  of  the  ring  and  downwelling 
at  the  face  of  the  ring  (Fig.  4a).  Here,  back  and  face 
are  defined  along  the  )0-induced  movement.  On  the  ba¬ 
sis  of  potential  vorticity  considerations  we  would  expect 
/-plane  subduction  to  occur  at  the  edge  of  the  ring  and 
upwelling  in  the  middle  when  some  sort  of  confluence 
is  present  near  the  ring  edge  (e.g.,  Pollard  and  Regier, 
1992;  Spall,  1995).  This  confluence  can  be  provided  for 
by  the  developing  baroclinic  instability.  Apparently,  the 
instability  process  develops  slowly  for  Agulhas  rings,  so 
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Figure  4.  Meridional  overturning  at  day  11  with  alpha— 1. 
Units  are  in  milliSverdrups.  a)  left  panel  unforced  run;  b) 
right  panel  run  with  cooling. 


that  the  associated  confluence  is  too  weak  to  overcome 
the  /?>induced  movement.  The  resulting  single-cell  over¬ 
turning  slightly  extends  across  the  ring  boundaries  and 
is  associated  with  adiabatic  stirring  of  the  ring’s  density 
anomaly  across  its  boundaries.  When  the  ring  is  stable 
the  associated  adiabatic  decay  is  (relatively)  more  than 
twice  as  slow,  too  weak  to  account  for  the  observed  fast 
decay  regime.  It  is  disturbing  that  the  observed  rings 
seem  less  unstable  but  still  show  a  decay  rate  that  is 
only  found  for  modeled  rings  that  are  unstable. 

When  we  allow  for  cooling,  convection  initially  mod¬ 
ifies  the  /3-induced  overturning  cell.  Figure  4b  shows 
the  meridional  overturning  at  the  same  stage  (day  50) 
as  Fig.  4a,  but  now  with  cooling  turned  on.  The 
convection-induced  modification  on  the  overturning  is 
small.  The  only  effect  is  more  small-scale  structure. 
The  decrease  of  SSH  as  a  function  of  time  is  enhanced 
by  30%.  This  is  due  to  the  direct  effect  of  heat  re¬ 
lease  through  the  surface.  For  stable  rings  the  impact 
of  cooling  is  smaller;  too  small  to  recover  the  observed 
fast  decay  regime.  The  impact  of  convection  on  the 
baroclinic  instability  is  weak.  This  is  contrary  to  the 
case  of  convective  chimneys.  There,  the  stratification  is 
already  weak,  and  cooling  can  significantly  change  the 
initial  stratification  and  reduce  the  Ross  by  radius.  This 
may  have  a  severe  impact  on  the  stability  characteris¬ 
tics  of  the  convective  chimneys  {Legg  and  McWilliams, 
2000).  In  case  of  Agulhas  rings,  the  stratification  is 
much  stronger  and  the  depth  penetration  of  the  effect 
of  cooling  is  much  less.  As  a  result  the  Rossby  radius 
is  hardly  reduced  by  convection,  and  the  dominant  ef¬ 
fect  is  the  reduction  of  vertical  shear  in  the  upper  layers. 
Subsequently,  cooling  slightly  stabilizes  the  ring  and  the 
instability  process  develops  marginally  slower.  Figure  5 
shows  the  evolution  of  upper  layer  thickness  for  a  =  1 
with  cooling  turned  on  at  day  zero.  Comparing  Figs.  5 
and  2  we  observe  only  small  differences;  the  splitting 
process  occurs  somewhat  later  in  the  case  when  cooling 


Figure  5.  Thickness  of  the  upper  layer  at  days  50,  60,  70 
and  80  with  alpha=l.,  and  cooling  turned  on. 

has  been  turned  on. 

Discussion  and  Conclusions 

Both  adiabatic  (/?-) decay  and  baroclinic  instability 
seem  to  account  for  the  observed  initial  fast  decay  of 
Agulhas  rings.  The  overturning  associated  with  the  /?- 
induced  movement  implies  cross-frontal  mass  exchange. 
Baroclinic  instability  mainly  occurs  for  a  wavenumber- 2 
disturbance  which  leads  to  ring  splitting.  The  splitting 
itself  hardly  affects  the  SSH  and  density  anomalies  of 
the  associated  rings.  Weaker,  stable  rings  decay  rela¬ 
tively  more  than  twice  as  slow.  Either  the  decay  rate  is 
a  nonlinear  function  of  the  ring  strength,  or,  baroclinic 
instability  speeds  up  the  overall  decay.  In  that  case, 
adiabatic  decay  does  not  explain  the  fast  decay  regime 
of  the  observed  rings,  as  these  seem  more  stable  than 
the  modeled  rings.  From  the  observed  rings  30%  split 
(Schouten  et  al.,  2000),  while  in  the  model  a  reduction 
in  ring  strength  by  a  factor  of  2  relative  to  observed  still 
leads  to  a  split. 

The  impact  of  cooling  on  the  decay  of  Agulhas  rings 
is  rather  modest.  Both  the  strong  stratification  and  rel¬ 
atively  weak  air/sea  temperature  contrasts  seem  to  be 
responsible  for  this.  Cooling  on  the  eddy  scale  is  strong; 
115  W  m“^  (Drijfhout  and  Walsteijn  1998).  In  a 
Gulf  Stream-like  environment,  cooling  of  rings  signifi¬ 
cantly  affects  the  regional  heat  balance  and  the  decay 
of  warm-core  Gulf  Stream  rings.  In  the  case  of  Agul- 
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Figure  6.  Zonal  section  of  salinity  through  ring  Astrid, 
taken  at  the  second  MARE  cruise. 

has  rings,  however,  in  general  the  depth  penetration  of 
cooling  is  too  small  to  have  such  large  impacts  on  the 
density  and  SSH  anomaly  of  the  rings.  The  decay  rate  is 
typically  enhanced  by  30%;  for  weaker  rings  the  impact 
of  cooling  becomes  less. 

In  our  opinion,  the  most  likely  candidate  to  affect 
the  decay  of  Agulhas  rings  we  have  overlooked  so  far, 
seems  to  be  interleaving  by  double-diffusive  driven  ther¬ 
mohaline  intrusions.  Both  in  case  of  meddies  and  warm 
core  Gulf  Stream  rings,  double-diffusive  driven  inter¬ 
leaving  has  been  identified  as  a  main  candidate  in  the 
decay  of  the  mesoscale  feature  (e.g.,  Tang  et  ai,  1985, 
Ruddick^  1992).  A  zonal  section  through  the  edge  of 
ring  Astrid  obtained  form  MARE- 2  shows  thermoha¬ 
line  intrusions  to  be  operative  between  100  and  150  nm 
from  the  coast  and  at  200  m  depth  (Fig.  6).  A  further 
analysis  of  the  data  is  needed  to  identify  whether  these 
thermohaline  intrusions  have  a  double-diffusive  origin. 
Also,  the  present  numerical  model  has  to  be  severely 
extended  to  be  able  to  account  for  the  development  of 
double-diffusive-driven  thermohaline  intrusions.  Such  a 
study  is  left  for  the  future. 
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Vertical  shear  plus  horizontal  stretching  as  a  route 
to  mixing 

Peter  H.  Haynes 

Department  of  Applied  Mathematics  and  Theoretical  Physics,  University  of  Cambridge,  UK 

Abstract.  The  combined  effect  of  vertical  shear  and  horizontal  stretching 
leads  to  thin,  sloping  structures  in  tracer  fields,  whose  vertical  length  scale 
is  much  smaller  than  their  horizontal  length  scale.  These  structures  are 
then  vulnerable  to  vertical  mixing  processes.  This  effect  needs  to  be  taken 
into  account  when  interpreting  the  horizontal  structure  of  oceanic  (and 
atmospheric)  tracers  and  may  explain  recent  oceanographic  observations. 


1.  Introduction 

In  both  atmosphere  and  ocean  it  is  useful  to  divide 
velocity  fields  into  isentropic  or  isopycnal  parts,  which 
move  fluid  parcels  along  potential  temperature  or  po¬ 
tential  density  surfaces,  and  diabatic  or  diapycnal  parts, 
which  move  fluid  parcels  across  such  surfaces.  This  is 
because  diabatic  or  diapycnal  motion  can  be  accom¬ 
plished  only  by  non-conservative  physical  processes  such 
as  radiative  heating  (in  the  atmosphere)  and  molecular 
diflFusion  of  temperature  enhanced  by  three-dimensional 
turbulence  (in  atmosphere  and  ocean).  In  large  parts  of 
the  atmosphere  and  ocean,  such  processes  are  weak  in 
some  average  sense,  so  that  diabatic  or  diapycnal  veloc¬ 
ities  must  be  much  smaller  than  isentropic  or  isopycnal 
velocities. 

The  process  of  stirring  may  therefore  be  regarded  as 
layerwise  two-dimensional,  in  the  sense  that  differential 
advection  by  the  quasi-horizontal  isentropic/isopycnal 
velocity  field  acts  independently  on  each  isentropic  or 
isopycnal  surface  to  distort  tracer  fields  into  complex 
geometrical  configurations.  If  atmospheric  and  oceanic 
flows  were  purely  two-dimensional,  similar  to  many  nu¬ 
merical  simulations  or  laboratory  experiments,  then 
stirring  would  simply  strengthen  horizontal  spatial  gra¬ 
dients  of  tracer  fields  until  horizontal  diffusion  became 
competitive  with  advection  and  mixing  occured.  But 
in  layerwise  two-dimensional  atmospheric  and  oceanic 
flows,  horizontal  advection  that  varies  in  the  vertical 
may  also  act  to  increase  vertical  gradients,  thereby  en¬ 
hancing  the  effects  of  vertical  diffusion,  or  more  compli¬ 
cated  vertical  mixing  processes.  The  fact  that  vertical 
scales  axe  almost  always  observed  to  be  considerably 
smaller  than  horizontal  scales  suggests  that  mixing  dif¬ 
fusion  rather  than  horizontal  mixing  is  the  dominant 
process. 

Most  previous  theoretical  investigations  of  stirring 


and  mixing  of  tracers  have  focussed  on  models  of  quasi¬ 
isotropic  flows,  either  in  two  or  three  dimensions.  The 
considerations  above  suggest  that  the  most  relevant 
model  for  the  atmosphere  and  the  ocean  is  an  anisotropic 
model  in  which  the  vertical  velocity  is  zero,  but  the 
horizontal  velocities  depend  on  the  vertical  coordinate. 
Haynes  and  Anglade  (1997,  hereafter  HA97)  used  such 
an  amsotropic  model  to  study  the  enhancement  of  ver¬ 
tical  gradients  by  horizontal  stirring  plus  vertical  shear. 
This  is  a  relatively  simple  extension  to  previous  work, 
but  essential  if  theoretical  results  are  to  be  relevant  to 
real  flows.  The  HA97  work  was  motivated  primarily  by 
the  stratosphere.  This  article  reviews  and  expands  on 
some  of  the  arguments  in  that  paper,  focussing  in  par¬ 
ticular  on  the  implications  for  stirring  and  mixing  in 
the  ocean. 

The  model  to  be  presented  is  based  on  the  paradigm 
of  ‘chaotic  advection’,  that  flows  that  are  smoothly 
varying  in  space  and  time  may  lead  to  patterns  of  ad- 
vected  tracer  that  are  spatially  highly  complex.  One 
of  the  requirements  on  a  chaotic  advection  flow  is  that 
there  is  a  useful  division  of  scales  between  the  smallest 
active  scale  in  the  advecting  velocity  field  and  the  dif¬ 
fusive  scale  (or  more  generally  scale  at  which  genuine 
processes  act).  The  case  of  three-dimensional 
turbulence,  for  example,  does  not  fall  into  this  cate¬ 
gory,  since  the  velocity  field  has  complex  spatial  struc¬ 
ture  at  scales  down  to  the  diffusive  scale  and  it  may 
be  argued  that  the  stirring  of  tracer  features  at  any 
scale  larger  than  the  diffusive  scale  is  dominated  by 
the  flow  at  the  same  scale  (rather  than  at  the  large 
scale).  In  fact  the  chaotic  advection  paradigm  is  rele¬ 
vant  in  three-dimensionally  turbident  flows  for  tracers 
with  diffusivity  much  less  than  the  momentum  diffusiv- 
ity  (large  Schmidt  number).  The  tracer  field  may  then 
have  non-trivial  structure  on  a  spatial  scale  smaller  than 
the  Kolmogorov  scale,  with  the  flow  at  the  Kolmogorov 
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scale  providing  the  ‘large-scale’  stirring.  This  is  often 
called  the  Batchelor  regime  of  turbulence  and  has  been 
studied  as  an  important  special  case  in  recent  theoret¬ 
ical  work  on  tracer  fields  in  turbulent  fiows  (e.g.  see 
Shraiman  and  Siggia,  2000  and  references  therein). 

In  the  stratosphere  it  seems  to  be  useful  to  consider 
tracer  fields  to  be  determined  by  the  stirring  effect  of  the 
large-scale  flow  (varying  on  spatial  scales  of  hundreds 
to  thousands  of  kilometers)  and  by  the  mixing  effect  of 
localised  patches  of  three-dimensional  turbulence,  with 
vertical  scales  of  a  hundred  meters  or  so  and  horizontal 
scales  of  a  few  kilometers.  There  is  no  strong  evidence 
of  active  stirring  by  eddies  on  scales  from  a  few  hundred 
down  to  tens  of  kilometers,  and  in  this  range  of  scales 
the  chaotic  advection  paradigm  is  therefore  relevant. 

Similarly  in  the  ocean  one  might  argue  that  mesoscale 
eddies  with  scales  of  tens  of  kilometers  are  the  dominant 
part  of  the  flow  in  stirring  the  tracer  field  at  scales  from 
a  few  tens  of  kilometers  down  to  the  scale  of  small-scale 
mixing  events. 

The  structure  of  the  paper  is  as  follows.  Section  2  de¬ 
scribes  a  simple  model  problem  of  steady  flow  including 
both  horizontal  stretching  and  vertical  shear,  focussing 
on  the  spreading  of  a  tracer  from  a  point  release  in  such 
a  flow.  Section  3  discusses  some  aspects  of  tracer  evolu¬ 
tion  in  more  general  random  flows.  Section  4  considers 
application  of  these  ideas  to  the  observations  of  Ledwell 
et  al.  (1993)  (hereafter  L93).  Section  5  gives  a  brief 
discussion  of  the  possible  competing  role  of  mixing  by 
double-diffusive  intrusions-  Section  6  describes  some 
possible  future  lines  of  research. 

2.  Steady  flow  model 

The  simplification  allowed  by  the  chaotic  advection 
paradigm  is  that  there  is  a  finite  scale  below  which  the 
flow  may  be  considered  as  a  linear  function  of  the  space 
coordinates,  i.e.  the  velocity  field  may  be  expanded 
as  a  (multidimensional)  Taylor  series  about  some  ref¬ 
erence  location  and  only  the  linear  term  in  the  series 
retained.  The  effects  of  small-scale  mixing  events  may 
represented  by  a  diffusion  term  (but  more  sophisticated 
representations  are  also  possible). 

We  follow  HA97  in  considering  first  a  steady  linear 
flow  that  has  the  two  ingredients  of  horizontal  strain,  F, 
plus  vertical  shear,  A,  In  much  of  the  atmosphere  and 
ocean  it  is  relevant  to  consider  the  parameter  regime 
A  ^  r,  i.e.  vertical  shear  much  larger  than  horizontal 
strain.  This  may  be  argued  from  the  observed  fact  that 
horizontal  length  scales  tend  to  be  larger  than  verti¬ 
cal  scales  or  by  appealing  to  theoretical  arguments  that 
the  ratio  of  horizontal  to  vertical  scales  is  of  order  f/N 
(Prandtl’s  ratio),  where  /  is  the  Coriolis  parameter  and 


N  the  buoyancy  frequency.  f/N  is  0(100)  in  the  at¬ 
mosphere  and  in  the  oceanic  thermocline  and  perhaps 
reduces  to  0(10)  in  the  deep  ocean. 

The  flow  is  taken  to  have  components  (Fx,  —Ty  -h 
Az,  0)  in  the  x,  y  and  z  directions  respectively,  where 
(x,y,  z)  are  Cartesian  coordinates,  with  z  vertical.  The 
steadiness  assumption  implies  that  the  horizontal  strain 
F  and  the  vertical  shear  A  are  constants.  The  equation 
for  the  evolution  in  this  flow  of  a  tracer  with  concentra¬ 
tion  x(a^,  y,  z,  t)  is  therefore 


— -f-Fx 
dt^  dx 


dy  '9x2  dy^^dz^' 


(1) 


where  k  is  the  diffusivity,  assumed  constant. 

HA97  considered  sinusoidal  solutions  of  this  equa¬ 
tion.  Another  useful  solution  of  this  equation,  and  in¬ 
deed  of  the  equation  for  tracer  concentration  in  any  flow 
where  velocity  components  are  linear  functions  of  space 
coordinates  (including  time-dependent  flows),  is  an  el¬ 
lipsoidal  Gaussian  solution,  i.e.  an  exponential  func¬ 
tion  of  a  negative  definite  quadratic  function  of  x,  y 
and  z.  This  is  conveniently  obtained  by  considering  the 
second-order  spatial  moments  of  the  solution,  such  as 
mxi  =  f  f  xdV,  nrixy  etc.  The  six  moments 

are  sufficient  to  determine  the  quadratic  function  and 
the  constraint  that  the  total  amount  of  tracer  remains 
constant  then  determines  the  coefficient  of  the  exponen¬ 
tial  function.  In  the  above  flow,  the  equations  for  the 
moments  may  be  shown  to  be 


^^TTLxx  —  2F?71xx  "f”  ^ 

(2) 

^  -A 

—  I\.7Tixz 

(3) 

d 

^^rrixx  —  FttZxx 

(4) 

rriyy  — "  — 2F77iyy  “j”  2A7T2>yx  "1“  2AC 

(5) 

d 

d 

(6) 

(7) 

These  equations  may  be  solved  straightforwardly. 
For  brevity  we  consider  the  case  where  all  moments 
tend  to  zero  as  t  0“^,  corresponding  to  a  point  re¬ 
lease.  Then  the  solutions  are 


nixx  — 


F^ 


(8) 


(9) 
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myy  =  ^  {2Tt  +  4e-^‘  -  -  3}+^  {l  - 

(10) 

my,  =  2'^{Tt-l  +  e-^^}  (11) 

rrizz  =  2Kt  (12) 

First  consider  the  case  where  A  =  0,  i.e.  the  case  of 
purely  two-dimensional  flow.  Thenma;x  = 
and  TTiyy  =  k{1  —  e^^^^)/r.  For  small  times,  t  F”^, 
diffusion  dominates  and  both  and  myy  increase  lin¬ 
early  with  time.  When  t  ~  F""^  advection  by  the  hor¬ 
izontal  strain  flow  becomes  competitive  with  diffusion. 
The  tracer  elongates  in  the  direction  of  the  stretch¬ 
ing  axis,  so  that  for  t  »  F“■^  mxx  ~  whilst 

in  the  direction  of  the  compression  axis  a  steady-state 
balance  between  strain  and  diffusion  is  achieved,  with 
myy  ~  k/T,  At  all  times  the  only  vertical  transport  is 
tluough  diffusion,  so  mzz  increases  linearly  with  t  at  all 
times. 

What  is  the  effect  of  adding  vertical  shear,  i.e.  tak¬ 
ing  0  (with  A  ^  F)?  The  elongation  of  the  tracer 
along  the  stretching  axis  of  the  horizontal  strain  axis  is 
imchanged.  However,  the  spreading  of  the  tracer  in  the 
direction  of  the  compression  axis  is  rather  different.  Ex¬ 
amination  of  the  expression  for  myy  above  shows  that 
there  are  three  regimes.  For  t  <  A~^  (regime  I)  there  is 
purely  diffusive  spreading  as  above.  For  A“^  <  t<  F"”^ 
(regime  II)  the  diffusive  spreading  of  the  tracer  in  the 
vertical  allows  the  vertical  shear  to  augment  the  spread¬ 
ing  in  the  horizontal,  so  that  myy  kAH^.  Finally 
when  t  ~  F”^  horizontal  advection  begins  to  inhibit 
the  horizontal  spreading,  so  that  for  t>T~^  (regime 
III),  myy  ^  KtA^/r^.  These  three  regimes  are  depicted 
graphically  in  Figure  1.  Note  that  myy  continues  to  in¬ 
crease  with  time,  in  contrast  to  the  case  with  A  =  0. 
In  regime  III  myy  ~  Am^^^s/F  A^m^^/F^  suggest¬ 
ing  that  the  y  and  z  scales  of  the  tracer  patch  are  in 
the  ratio  A/F.  Indeed  what  happens  is  that  the  tracer 
patch  forms  a  sheet  that  slopes  at  angle  F/A  (from 
the  horizontal)  in  the  (y,  z)  plane.  The  extent  of  the 
sheet  in  the  vertical  direction  is  (/ct)^/^,  therefore  the 
extent  of  the  sheet  in  the  horizontal  (y)  direction  is 
A(/ct)^/^/F.  This  explains  the  dominant  behaviour  of 
myy,  myz  aud  In  order  to  deduce  the  spread  of 
the  tracer  on  a  given  horizontal  surface  it  is  useful  to 
consider  myy  -  2Amyz/T  +  mzz^^/'^^,  i.e.  the  moment 
of  (y  —  Az/F)^.  It  is  straightforward  to  show  that 

myy  -  2Amyz/T  -h  m^zA^/F^  = 

(13) 


(Kt)>«a  (I) 


Figure  1.  The  spread  of  tracer  in  the  (y,z)  in  each  of 
regimes  I  (t;$  A“^),  regime  II  (A“^<  t<  F"^)  and  regime 

in  (F“^<t). 


Thus  the  spread  of  the  tracer  on  a  given  horizon¬ 
tal  surface  is  («:/F)^/^A/F.  The  effect  of  the  vertical 
shear  is  therefore  to  increase  the  equilibrium  width  of 
the  tracer  patch  by  a  factor  A/F. 

In  summary,  the  effect  of  the  vertical  shear  is  to  lead 
to  sloping  structures  in  the  tracer  field  with  aspect  ratio 
(ratio  of  horizontal  length  scale  to  vertical  length  scale) 
a  =  A/F.  The  equlibrium  width  of  these  structures 
is  larger  by  a  factor  a  than  would  be  the  case  without 
vertical  shear.  One  might  say  that  it  is  as  if  a.  horizontal 
diffusivity  ko^  were  acting. 

3.  Time-dependent  flow  models 

In  a  fiow  that  varies  in  space  and  time,  the  velocity 
gradient  encountered  by  a  fluid  parcel  changes  with  time 
as  the  particle  moves  through  the  flow.  The  evolution  of 
small-scale  tracer  features  is  therefore  governed  by  the 
equation  for  tracer  evolution  in  a  time-dependent  linear 
flow.  In  the  study  of  turbulence  there  is  a  long  tradition 
of  considering  the  effect  of  a  linear  flow  that  varies  ran¬ 
domly  in  time  (using  so-called  ‘random-straining’  mod¬ 
els).  This  approach  has  been  used  most  recently  by 
various  authors  to  give  a  rather  complete  description  of 
the  statistics  of  tracer  fields  in  Batchelor-regime  turbu¬ 
lence.  [See  Balkovsky  and  Fouxon,  1999  and  Falkovich 
et  al,  2001  for  further  details  of  this  work.] 

The  random-straining  model  most  relevant  to  the  at¬ 
mosphere  and  ocean  is  non-standard  in  that  the  statis¬ 
tics  of  the  velocity  gradient  tensor  is  not  isotropic  (since 
the  vertical  velocity  is  zero,  but  the  vertical  gradient  of 
the  horizontal  velocity  is  not).  HA97  considered  such 
random-straining  models  and  showed  that  the  impor¬ 
tant  predictions  of  the  steady  flow  model  presented  in 
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Section  2  carried  over,  in  the  sense  that  the  tracer  field 
was  predicted  to  form  sloping  sheets,  with  an  aspect  ra¬ 
tio  a  that  depended  on  the  statistics  of  the  horizontal 
strain  and  vertical  shear  fields. 

We  review  some  of  the  HA97  results  below,  empha¬ 
sising  the  dependence  of  a  on  the  different  flow  parame¬ 
ters.  One  particular  goal  is  to  identify  conditions  under 
which  a  is  not  simply  equal  to  the  aspect  ratio  A/F  of 
the  flow  itself. 

It  is  convenient  to  consider  the  normalised  gradient  of 
the  tracer  field,  defined  by  k(t)  =  evaluated  at 

the  position  x  =  X{t)  following  a  fluid  parcel,  k  might 
be  considered  as  a  local  wavenumber  of  the  tracer  field 
in  the  case  where  the  tracer  field  varies  rapidly  in  space 
compared  to  the  velocity  field.  It  may  be  shown  from 
the  gradient  of  the  tracer  advection  equation  that 

f  =  (14) 

where  the  scalar  product  on  the  right-hand  side  applies 
to  the  second  index  of  the  tensor  Vu.  The  evolution  of  k 
therefore  depends  on  the  time  series  of  Vu  (encountered 
following  the  point  X(t)).  In  a  random-straining  model 
the  eflFect  of  turbulent  flow  on  gradients  of  tracer  (or 
line  elements)  is  considered  as  a  kinematical  problem, 
in  which  the  tensor  Vu  is  taken  to  be  given  by  a  random 
time  series,  without  addressing  questions  of  the  flow 
dynamics. 

For  the  layerwise  twodimensional  flows  of  interest 
here,  the  matrix  W  of  components  of  the  velocity  gra¬ 
dient  tensor,  with  Wij  ^  dui/dxj,  has  Wzi  =  1^32  = 
=  0.  (The  index  3  is  taken  to  correspond  to 
the  2  coordinate.)  It  is  useful  to  write  k  =  (k^'^),!/!), 
with  k^^^  the  horizontal  wavenumber  and  m  the  verti¬ 
cal  wavenumber,  and  define  to  be  the  2nd  rank 
tensor  with  components  Wu,  W21,  W12  and  W22,  and  A 
to  be  the  vector  with  components  ^'23)-  It  then 
follows  that 

^  (15) 

and 

=  (16) 

In  what  follows  the  summation  convention  is  used  for 
repeated  suffices  and  the  suffices  run  through  the  val¬ 
ues  {1,2}.  Equation  (15)  is  just  that  for  the  tracer 
wavenumber  in  two-dimensional  (horizontal)  flow.  Note 
that  it  may  be  solved  independently  of  (16).  Thus  vari¬ 
ation  in  the  z-direction  makes  no  difference  to  the  evolu¬ 
tion  of  the  horizontal  wavenumber  vector.  However  (16) 
shows  that  the  vertical  wavenumber  evolves  through  the 
vertical  shear  (in  the  horizontal  flow)  acting  on  the  hor¬ 
izontal  wavenumber  vector. 


To  formulate  a  suitable  random-straining  model  we 
assume  that  each  of  the  six  non-zero  components  of  the 
matrix  W  is  a  realisation  of  a  random  function  of  time, 
with  average  (over  all  realisations)  zero.  We  also  assume 
that  the  wavenumber  at  t  =  0  is  randomly  chosen  and 
is  statistically  independent  of  the  Wij.  Equation  (16) 
may  be  integrated  to  give 

m{t)  =  m{0)  +  Aj{t')kf\t')dt'  (17) 

and  then  squaring,  and  taking  the  ensemble  average,  it 
follows  that 

{m{tf)  =  (m(0)2)+ 

/‘dt'/‘df"(A,(tOfcf(f')Afe(t")A;r^(<")),  (18) 

Jo  Jo 

where  (.)  denotes  the  ensemble  average,  over  all  reali¬ 
sations  of  the  flow  field  and  all  realisations  of  the  initial 
wavenumber  vector. 

Useful  quantitative  estimates  are  possible  if  a  num¬ 
ber  of  further  assumptions  are  made  concerning  the 
statistics  of  the  velocity  gradient  tensor.  The  first  is 
that  the  components  of  A  are  independent  of  all  com¬ 
ponents  of  i.e.  that  the  vertical  shear  is  statis¬ 

tically  independent  of  the  horizontal  deformation.  It 
follows  that  A  is  also  independent  of  k^^^  so  that  av¬ 
erages  of  terms  involving  A  and  terms  involving  k^^^ 
may  be  taken  separately  in  (18),  The  second  impor¬ 
tant  assumption  is  stationarity,  from  which  it  follows 
||(Aj(t')AA:(t"))||  ^  A^p(aA|t'  ~  t"|),  for  some  function 
g,  where  cr^  is  an  inverse  correlation  time  for  the  vertical 
shear  and  A^  =  (A^). 

In  order  to  estimate  the  integral  appearing  in  (18) 
it  is  also  necessary  to  have  information  on  the  evolu¬ 
tion  of  the  horizontal  wavenumber.  This  follows  ex¬ 
actly  as  in  the  random  straining  models  of  isotropic 
two-dimensional  flow  considered  by  Kraichnan  (1974) 
and  others.  The  basic  predictions  of  these  theories  are 
that  the  horizontal  wavenumber  increases  exponentially 
in  time,  at  a  rate,  S  say,  governed  by  the  statistics 
of  the  horizontal  strain  field  and  depending  in  partic¬ 
ular  on  the  root  mean  square  rate  of  strain,  F  say, 
and  on  the  inverse  correlation  time  ah  for  the  hori¬ 
zontal  strain  field.  Detailed  analysis  of  explicit  models 
{Kraichnan^  1974;  Chertkov  et  al  ,  1995;  HA97)  sug¬ 
gests  that  5  rmin(r/o-h,l). 

Agadn,  making  suitable  stationarity  assumptions,  this 
suggests  that  IKA:f  ~  - 

t"|),  where  =  (k^'^)(0)^).  The  inverse  time  scale  /jl 
and  the  precise  form  of  the  function  A(.),  depend  on 
the  statistics  of  the  horizontal  strain  field,  but  explicit 
calculation  in  various  models  suggests  that  ~  5. 
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Substituting  these  estimates  into  (18)  it  follows  that, 
for  large  times,  when  the  second  term  on  the  right-hand 
side  dominates  the  first. 


(m(t)2)  ~ 


(19) 


Note  that  the  dominant  contribution  to  the  first  inte¬ 
gral,  over  performed  in  (18)  comes  from  a  neighbour¬ 
hood  of  t"  =  t  of  size  min{S“\cr;^^}  and  that  to  the 
second  integral,  over  comes  from  a  neighbourhood  of 
f  =  t  of  size  5^^. 

The  ratio  a  of  horizontal  length  scale  to  vertical 
length  scale,  or,  equivalently,  the  ratio  of  vertical  wavenum¬ 
ber  to  horizontal  wavenumber  may  therefore  be  esti¬ 
mated  as 


a 


{mm 


...  5 
^min{l,— } 


cta 


{kwm 

A^  (T^  r  r 

P2  “ax(l,  -|)  min{l,  —  min(l,  — )}.(20) 

^  A  afi 


with  the  last  estimate  following  from  the  estimate  for 
S. 


This  semi-quantitative  analysis  suggests  that  the  ver¬ 
tical  wavenumber  increases  exponentially  at  the  same 
rate  as  the  horizontal  wavenumber,  and  that  the  ratio  a 
is  equal  to  the  ratio  of  vertical  shear  to  horizontal  strain, 
A/r ,  multiplied  by  a  number  that  depends  on  o/i/T  and 
cta/T .  These  predictions  agree  with  explicit  calculations 
based  on  suitably  formulated  random-straining  models. 
(See  HA97  for  more  details.) 

Figure  2  summarises  the  variation  of  the  aspect  ra¬ 
tio  a  with  the  parameters  cr^  and  <7a.  Note  that  a  is 
anomalously  large  (i.e.  greater  than  A/F)  when  the  hor¬ 
izontal  strain  varies  on  a  timescale  that  is  shorter  than 
both  the  inverse  horizontal  strain  rate  and  the  timescale 
of  variation  of  the  vertical  shear.  On  the  other  hand, 
a  is  anomalously  small  (i.e.  less  than  A/F),  when  the 
vertical  shear  varies  on  a  timescale  that  is  shorter 
both  the  inverse  horizontal  strain  rate  and  the  timescale 
of  variation  of  the  horizontal  strain.  As  might  be  ex¬ 
pected,  in  the  limit  where  oa  <  F  and  cta  F  the 
estimate  for  a  agrees  with  that  obtained  for  the  steady 
flow  considered  in  the  previous  section. 

Note  that  in  this  section  we  have  not  considered  the 
effects  of  diflFusion  explicitly.  Nonetheless,  it  is  plausible 
that  diffusion  acts  in  a  similar  way  to  that  deduced 
from  the  steady  flow  model  in  Section  2,  i.e.  diffusion 
K  acts  on  structures  in  the  tracer  field  that  slope  with 
aspect  ratio  a  and  the  result,  e.g.  in  achieving  a  balance 
between  diffusion  and  horizontal  straining,  is  that  it  is 
as  if  there  is  a  horizontal  diffusivity  with  magnitude 
Ka^.  This  is  supported  (with  certain  limitations)  by 
explicit  calculations  in  Vanneste  and  Haynes  (2001). 


Figure  2.  Scaling  of  (where  a  is  aspect  ratio  of  hori¬ 
zontal  to  vertical  scale)  with  parameters  F  (horizontal  strain 
rate),  A  (vertical  shear),  ah  (inverse  correlation  time  for  hor¬ 
izontal  strain)  and  a\  (inverse  correlation  time  for  vertical 
shear).  The  thin  dashed  lines  delimit  the  regions  where  the 
different  estimates  hold.  The  thick  dashed  lines  delimit  the 
regions  where  a  scales  as  the  aspect  ratio  of  the  velocity  field 
(A/F)  (lower  left-hand  region),  is  larger  than  that  aspect  ra¬ 
tio,  (right-hand  region)  and  is  smaller  than  that  aspect  ratio 
(upper  region). 

4.  Application  to  the  Ledwell  et  al, 
(1993)  observations 

In  the  tracer  release  experiment  reported  by  L93  the 
dispersion  of  a  tracer  in  the  ocean  thermocline  was 
followed  over  a  period  of  several  months.  The  verti¬ 
cal  (cross-isopycnal)  diffusion  could  be  estimated  di¬ 
rectly  and  a  value  of  vertical  diffusivity  of  about 
10'5ni2s-iwas  inferred.  In  the  horizontal  the  tracer 
was,  in  the  later  stages  of  the  experiment,  observed 
to  be  confined  to  thin  streaks,  whose  width  apparently 
reached  an  equilibrium  value  of  about  3  km.  This  was 
interpreted  as  the  stretching  out  of  the  tracer  patch  by 
the  mesoscale  eddy  field,  resulting  in  filaments  of  tracer 
whose  width  was  the  equilibrium  value  determined  by  a 
balance  between  horizontal  stretching  (which  tends  to 
reduce  the  width  of  the  filament)  and  horizontal  disper¬ 
sive  effects,  perhaps  associated  with  small-scale  mixing 
processes  (which  tend  to  increase  the  width  of  the  fila¬ 
ment).  Prom  the  observed  length  of  the  tracer  filament 
it  was  estimated  that  the  average  stretching  rate  expe¬ 
rienced  was  about  3  x  10-’’s-^  It  followed  that  if  the 
horizontal  dispersive  effects  could  be  represented  by  an 
effective  horizontal  diffusivity  «a,  it  must  have  a  value 
of  about  3  m2s~^(~  (3  km)®  x  3  x  10“'^s“^).  Thus 
Kh/Ky  ~  3  X  10®.  Previous  work  by  Young  et  al.  (1982), 
taking  account  of  the  combined  effect  of  vertical  mixing 
and  horizontal  advection  by  inertio-gravity  waves  had 
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suggested  kh/k^  ~  1500,  i.e.  200  times  too 

small  to  account  for  the  L93  observations. 

HA97  noted  that  the  combined  effect  of  horizontal 
stretching  and  vertical  shear  could  account  for  the  value 
of  KhfKy  if  the  aspect  ratio  a  was  about  500.  According 
to  the  analysis  presented  in  Section  3  this  requires  A  ~ 
1.5  X  10“^s”^,  based  on  the  estimate  a  A/S  (which 
holds  if  ^  S',  i.e.  the  correlation  time  for  the  vertical 
shear  must  be  larger  than  S”^).  This  value  of  vertical 
shear  (equivalent  to  15  cm  s~^  per  km)  does  not  seem 
indefensible  as  realistic. 

If  the  models  described  in  this  article  are  to  be  rel¬ 
evant  to  the  L93  observations,  one  might  ask  further 
whether  the  observed  morphology  of  the  tracer  is  consis¬ 
tent  with  the  model  predictions.  First  it  is  clear  that  in 
a  flow  with  finite  length  scales  the  exponential  stretch¬ 
ing  in  a  single  direction  predicted  by  the  model  in  Sec¬ 
tion  2  must  break  down  when  the  largest  length  scale  of 
the  tracer  patch  (or,  at  this  stage,  tracer  filaments)  be¬ 
comes  comparable  to  the  length  scale  of  the  flow.  \^at 
happens  is  simply  that  the  filaments  meander  on  the 
length  scale  of  the  flow  (as  observed  in  countless  labora¬ 
tory  and  numerical  experiments) .  Perhaps  more  crucial 
is  the  shape  of  tracer  filaments  in  cross-section.  The 
model  of  Section  2  predicts  that  this  will  ultimately  be 
highly  elongated,  so  that  the  tracer  is  actually  concen¬ 
trated  within  sloping  sheets.  However,  this  applies  on 
time  scales  much  greater  than  where  F  is  the  strain 
rate  and  therefore,  in  the  steady  model,  the  stretching 
rate.  For  times  comparable  to  F~^  (see  Figure  1),  the 
tracer  distribution  is  not  sheet-like,  but  it  is  simply  the 
case  that  the  filament  cross  section  has  much  larger  hor¬ 
izontal  extent  than  vertical  extent  (by  a  factor  a).  The 
observing  period  reported  by  L93  is  such  that  Ft  <  6, 
so  that  sheet-like  features  are  not  necessarily  expected. 
[Note  that  the  extent  of  the  sheets  is  proportional  to 
(Ft)V2.] 

5.  Possible  effects  of  double  diffusion 

Garrett  (1982)  (hereafter  G82)  noted  that  the  stir¬ 
ring  by  mesoscale  eddies  of  temperature  and  salinity 
along  isopycnal  surfaces  would  lead  to  large  local  gra¬ 
dients  in  these  quantities  manifested  as  thermohaline 
fronts.  These  in  turn  might  lead  to  double-diffusive  in¬ 
trusions.  G82  argued  that  the  sharpening  of  gradients 
of  temperature  and  salinity  by  stirring  would  be  halted 
by  the  mixing  effects  of  the  intrusions  when  the  growth 
rate  of  the  intrusions  was  equal  to  the  convergence  act¬ 
ing  on  the  fronts,  i.e.  to  the  stretching  rate  associated 
with  the  stirring  process.  He  gave  the  formula  (based 
on  previous  theoretical  and  experimental  work) 

Arrtax  =  0.075^  (21) 


for  the  maximum  growth  rate  \max  of  the  intru¬ 
sions,  where  g  is  the  gravitational  acceleration,  0  = 
P~^{.dp/dS)T  is  the  rate  of  change,  at  constant  tem¬ 
perature,  of  density  with  salinity,  and  Sx  is  the  salinity 
gradient  in  the  neighbourhood  of  the  front.  He  argued 
that  if  the  front  had  width  Lf  then  the  approximate 
value  of  Sx  would  be  Sx  =  SxL^ddyjLf  where  Sx  was 
the  large-scale  salinity  gradient  and  Leddy  was  the  typ¬ 
ical  size  of  the  mesoscale  eddies.  If  s  is  the  stretching 
rate  then  it  follows  that  Lf  is  given  by  the  formula 

X/ =  0.075^^;^^.  (22) 

For  the  values  of  s  and  Lf  observed  by  L93  (taking 
Lf  to  be  the  width  of  the  filaments)  and  assuming 
Leddy  ~  100  km,  this  would  require  g/3Sx/N  ~  10“’^s”^. 
This  is,  in  fact,  considerably  weaker  than  the  values  sug¬ 
gested  by  G82  as  examples.  This  therefore  appears  to 
leave  open  the  possibility  that  it  is  the  mixing  effects, 
along  isopycnal  surfaces,  of  double  diffusive  intrusions, 
rather  than  any  process  directly  related  to  ‘background’ 
vertical  mixing,  that  determines  the  width  of  the  fila¬ 
ments  observed  by  L93.  On  the  other  hand  it  is  not 
at  all  clear  that  the  filaments  of  a  tracer  injected  at  an 
arbitrary  location  will  match  the  regions  of  enhanced 
gradients  in  temperature  and  salinity  whose  distribu¬ 
tion  is  set  on  the  large  scale  and  therefore  feel  the  same 
mixing  effects, 

6.  Discussion 

The  work  reported  in  HA97  may  be  extended  in  var¬ 
ious  ways,  some  of  which  are  relevant  to  oceanographic 
considerations.  For  example,  Vanneste  and  Haynes 
(2001)  have  considered  the  effect  of  vertical  shear  on 
the  horizontal  wavenumber  spectrum  of  passive  tracers, 
in  particular  on  the  range  of  scales  where  the  effects  of 
diffusive  mixing  become  important.  In  particular  this 
work  highlights  the  limitations  of  estimating  the  effec¬ 
tive  horizontal  diffusivity  as  ko^.  The  recent  theoreti¬ 
cal  work  on  Batchelor-regime  turbulence  has  made  ex¬ 
plicit  predictions  about  probability  density  functions  for 
tracer  concentrations,  tracer  concentration  differences 
(over  a  finite  distance)  and  tracer  concentration  gradi¬ 
ents  and  this  work  needs  to  be  extended  to  the  layerwise 
two-dimensional  case  if  it  is  to  be  applicable  to  real  at¬ 
mospheric  and  oceanic  flows. 

Much  insight  into  atmospheric  tracer  distributions 
has  been  obtained  by  using  models  driven  by  observed 
velocity  fields,  or  velocity  fields  extracted  from  quasi- 
realistic  global  models.  Approaches  have  included  full 
numerical  integrations  of  the  tracer  evolution  equation 
(e.g.  on  an  isentropic  surface),  non-diffusive  reconstruc¬ 
tions  of  tracer  fields  by  following  back  trajectories  to  an 
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initial  condition  and  calculations  of  the  statistical  prop¬ 
erties  of  large  numbers  of  trajectories,  e.g.  to  give  the 
distribution  of  finite-time  stretching  rates.  See,  for  ex¬ 
ample,  the  papers  by  Schoeberl  and  Newman  (1995), 
Ngan  and  Shepherd  (1999)  and  Hu  and  Pierrehumbert 
(2001).  All  of  these  methods  have  substantial  savings 
over  integrations  of  a  full  model  including  tracer  equa¬ 
tions  and  dynamical  equations  and  some  may  be  worth 
applying  in  the  oceanic  context.  Velocity  fields  might 
be  extracted  from  eddy-resolving  models  for  this  pur¬ 
pose,  but  as  a  first  step  it  might  be  worth  considering 
‘synthetic’  velocity  fields  that  are  generated  quite  artifi¬ 
cially  to  have  a  plausibly  realistic  spatial  and  temporal 
structure  in  space  and  time. 
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Abstract.  Spiral  eddies  were  first  seen  in  the  sun  glitter  on  the  Apollo 
Mission  30  years  ago;  they  have  since  been  recorded  on  SAR  missions  and 
in  the  infr^ed.  The  spirals  are  globally  distributed,  10-25  km  in  size  and 
overwhelmingly  cyclonic.  They  have  not  been  explained.  Under  light  winds 
favorable  to  visualization,  linear  surface  features  with  high  surfactant  density 
and  low  surface  roughness  are  of  common  occurrence.  We  have  proposed  that 
frontal  formations  concentrate  the  ambient  shear  and  prevailing  surfactants. 
Horkontal  shear  instabilities  ensue  when  the  shear  becomes  comparable  to  the 
Coriolis  frequency.  The  resulting  vortices  wind  the  linear  features  into  spirals. 
The  hypothesis  needs  to  be  tested  by  prolonged  measurements  and  surface 
truth.  Spiral  eddies  are  a  manifestation  of  a  sub-mesoscale  oceanography 
associated  with  upper  ocean  stirring;  dimensional  considerations  suggest  a 
horizontal  diffusivity  of  order  10^m^s“^. 


Introduction 

The  first  photographs  of  spiral  eddies  appears  to 
have  been  taken  on  Apollo-Satum  in  October  1968.  In 
the  late  70s  SEASAT  with  its  synthetic  aperture  radar 
(SAR)  confirmed  the  early  discoveries  from  crewed 
space  flights  {Stevenson  1998,  1999).  But  most  of  the 
existing  material  was  collected  by  Paul  ScuUy-Power 
(the  first  and  so  far  only  oceanographer-astronaut)  on 
5-13  October  1984:  ‘Tar  and  away  the  most  impres¬ 
sive  discovery...  is  that  of  the  submesoscale  ocean  (less 
than  100  km)  is  far  more  complex  dynamically  than 
ever  imagined...  Patterns  of  this  complexity  could  be 
seen  to  be  interconnected  for  hundreds  and  hundreds  of 
kilometers”  {Scully-Power  1986). 

The  spiral  pattern  whose  global  distribution  was  re¬ 
ported  by  Sculiy-Power  is  at  an  awkward  scale,  virtually 
impossible  to  recognize  from  shipboard,  and  too  large 
to  be  encompassed  even  from  high-fl3dng  aircraft.  Dis¬ 
covery  had  to  await  space  missions. 

Spiral  Images  in  the  Sun  Glitter  and  in 
SAR 

Figures  1  and  2  show  a  visual  and  SAR  image,  re¬ 
spectively,  of  spiral  streak  patterns.  Spirals  are  globally 
distributed  (Figure  3).  Typical  spiral  dimensions  are 
from  10  to  20  km,  with  streaks  50  to  100  m  wide.  Spi¬ 
rals  are  overwhelmingly  cyclonic,  woxmd  anti-clockwise 
(viewed  from  above)  in  the  northern  hemisphere,  clock¬ 
wise  in  the  southern  hemisphere.  Ship  wakes  cross¬ 
ing  the  streaks  (not  shown)  have  a  cyclonic  offset  with 


Figure  1.  A  pair  of  interconnected  spirals  in  the 
Mediterranean  Sea  south  of  Crete.  This  vortex  pair 
has  a  clearly  visible  stagnation  point  between  the  two 
spirals,  the  cores  of  which  are  aligned  with  the  precon¬ 
ditioning  wind  field.  7  October  1984. 

shears  up  to  10”^s”‘^  We  need  to  refer  to  Munk  et  al 
(2000)  (henceforth  MAFZ)  for  a  more  representative  se¬ 
lection  (13  images  out  of  400  collected). 

The  observational  material  poses  three  questions: 

(A)  How  are  the  spirals  wound? 

(B)  How  is  symmetry  broken  in  favour  of  cyclonic 
rotation? 

(C)  What  makes  spirals  visible? 

On  SAR  images  the  streaks  are  always  dark,  indicat- 
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Figure  2.  Spirals  in  the  Mediterranean  Sea  visual¬ 
ized  with  Shuttle  XSAR.  The  streaks  are  differentially 
smooth.  9  October  1994. 
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Figure  3.  Distribution  of  spiral  eddies  from  Scully- 
Power  (1986)  visual  observations  and  our  collection  of 
400  images.  l?he  13  numbered  locations  refer  to  MAFZ, 


Figure  4.  Rossby  number  in  the  upper  250  m  sampled 
at  3  km  spacing  along  140®  W  from  25®  N  to  35°  N  in 
the  North  Pacific  {Rudnick  and  Ferrari  1999). 


ing  a  reduced  scattering  cross-section,  e.g.  differentially 
smooth  water.  Natural  biogenic  surface  films  are  or¬ 
ganized  by  near-surface  convergence  into  linear  streaks 
with  over  40%  surfactant  coverage  at  low  winds.  The 
concentration  is  associated  with  nearly  inextensible  sur¬ 
face  films  which  dissipate  capillaries  and  short  gravity 
waves.  The  film  thickness  required  to  dampen  the  short 
waves  is  only  0.01  to  0.1  //m.  On  the  optical  images  the 
smooth  streaks  are  bright  in  the  inner  sun  glitter  (which 
requires  low  rms  slopes  for  reflection  of  the  sun  into  the 
camera)  and  dark  in  the  outer  glitter.  The  situation 
is  complex  and  not  well  understood,  and  we  refer  to 
MAFZ  (1225-30,  1236-7)  for  further  discussion. 

But  evidently  the  third  question  can  be  restated  as 
follows:  What  is  the  circulation  pattern  that  collects  the 
surfactant  material  into  streaks  (which  are  subsequently 
wound  into  a  spiral  pattern)?  Multiple  stripes  at  kilo¬ 
meter  spacing  presumably  are  associated  with  helical 
circulation  rolls  in  the  atmospheric  boundary  layer.  In 
addition,  frontal  instabilities  can  concentrate  and  dis¬ 
tort  the  surfactant,  as  we  shall  see. 

Ambient  Ocean  Vorticity 

Measurement  of  surface  velocity  shear  du/dy  along 
1000  km  of  roughly  northward  track  in  the  North  Pa¬ 
cific  {Rudnick  and  Ferrari  1999)  indicate  values  of  order 
i0“^s”^.  The  situation  is  conveniently  portrayed  by  a 
distribution  of  Rossby  Numbers 

Ro  =  C//  (1) 

where  Q  =  dvjdx  -  du/dy  is  the  vertical  component  of 
vorticity  (cyclonic  is  positive)  and  /  is  the  Coriolis  fre¬ 
quency  (Figure  4).  The  distribution  is  symmetric,  with 
very  few  values  exceeding  There  are  a  few  outliers 
showing  a  slight  preference  of  cyclonic  vorticity  for  large 
|Roj,  and  this  has  since  been  confirmed  {Rudnick^  tlfis 
volume). 

The  above  paper  also  shows  that  the  shear  is  dis¬ 
tributed  over  a  broad  band  of  scales,  from  kilometers  to 
himdreds  of  kilometers. 

Horizontal  Shear  Instability 

Starting  from  parallel  shear  flow  with  an  inflection 
point,  Figure  5  shows  a  numerical  simulation  of  the 
development  of  the  most  unstable  mode  {Corcos  and 
Sherman  1976,  1984).  Time  is  in  units  of  the  initial 
reciprocal  shear  at  the  stagnation  point  The  numerical 
experiment  was  intended  to  model  a  vertical  shear  flow, 
but  may  as  well  be  interpreted  in  terms  of  a  horizontal 
shear  flow.  There  is  no  implication  of  the  sense  of  ro¬ 
tation;  in  fact  we  have  reversed  the  published  drawing 
from  anticyclonic  to  cyclonic  rotation. 
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readily  shown  that  vorticity  is  conserved: 

D(:/Dt  =  Q,  c  =  -v2$ 
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Figure  5.  Computer  simulations  of  a  developing  shear 
instability  {Corcos  and  Sherman  1984).  The  four  pan¬ 
els  show  the  streamlines  at  times  0.5,  1.0,  1.5,  2.0  (in 
umts  of  the  initial  reciprocal  shear).  Heavy  line  is  the 
cat’s-eye”  streamline  through  the  stagnation  points. 
The  dots  represent  particle  positions  initially  placed  on 
the  interface;  they  are  initially  crowded  near  the  two 
stagnation  points  to  allow  for  a  subsequent  large  strain. 
The  model  allows  for  diffusion  and  viscosity.  We  have 
reversed  the  original  figure  from  anticyclonic  to  cyclonic 
rotation. 

Streamlines  show  the  development  of  Kelvin’s  cele¬ 
brated  “cat’s-eye”  solution.  Particles  inserted  along  the 
interface  exhibit  the  growth  of  a  spiral. 

It  is  instructive  to  compare  this  nmnerical  experi¬ 
ment  with  StuaH’s  (1967)  steady-state  solution  for  a 
non-rotating  incompressible  equal-density  fluid  (Figure 
6).  The  streamlines  are  given  by 

’^(x,y)  =  —k~^U\og{co&\Lky  —  acoskx).  (2) 

For  a  =  0  we  have  u  =  -d-<i/dy  =  -C/tanh  fcy.  It  is 


(3) 

where  D/Dt  is  the  substantial  derivative  (followring  a 
particle).  The  circulation  F  =  f  C  dx  dy  =  AnU  k~^  is 
the  same  for  all  four  panels.  The  streamlines  through 
the  two  stagnation  points  separate  the  “core  circula¬ 
tion”  from  an  exterior  circulation  which  is  not  all  that 
different  from  the  case  a  =  0.  The  fractional  core  cir¬ 
culation 

Fc/F  =  47r~^tan~^  y/a  (4) 

increases  wdth  increasing  o:.  The  four  panels  give  in¬ 
dependent  steady-state  solutions  for  different  values  of 
the  independent  parameter  a,  but  we  are  tempted,  by 
comparison  of  the  two  figures,  to  interpret  the  Stuart  so¬ 
lution  as  a  developing  instability  for  a  slowly  increasing 
a(t).  In  MAFZ  we  demonstrate  that  an  initial  particle 
injection  produces  mature  spirals,  as  in  the  Corcos  and 
Sherman  numerical  experiment,  some  of  them  resem¬ 
bling  the  vortex  pair  in  the  Cretan  Sea  (Figure  1).  The 
proposed  response  to  question  A  is  that  the  spirals  are 
a  mamfestation  of  a  horizontal  shear  instability. 

There  is  as  yet  no  breaking  of  symmetry.  In  fact, 
replacing  D^jDt  =  0  by 

D(C  +  /)/Ut  =  0  (5) 

does  not  change  the  developing  streamline  pattern,  so 
that  planetary  vorticity  is  conserved  in  the  /-plane. 
But  the  developing  pressure  patterns  are  fundamentally 
modified  {^MAZD  figure  22).  Starting  with  the  Stuart 
solution  of  a  low  pressure  in  the  core  (regardless  of  the 
sense  of  rotation)  and  gradually  turning  on  the  Corio¬ 
lis  parameter  /,  the  core  pressure  remains  a  LOW  for 
the  cyclonic  case  but  switches  to  a  HIGH  for  the  an¬ 
ticyclonic  case.  Clearly  there  is  here  the  dynamics  for 
breaking  synunetry. 

Breaking  Symmetry:  the 
Hoskins-Bretherton  Jet 

Hoskins  and  Bretherton  (1972)  have  solved  a  prob¬ 
lem  of  frontogenesis  with  conservation  of  density  and 
potentied  vorticity, 

D{p,  q)/Dt  =  0,  p  9  =  (/  +  C)  •  Vp  (6) 

where  D/Dt  is  the  substantial  derivative.  The  start¬ 
ing  point  is  a  vertically  mixed  layer  with  a  horizontal 
density  transition  from  warm  and  light  in  the  south 
(say)  to  cold  and  heavy  in  the  north  (Figure  7,  left). 
The  initial  density  gradient  develops  into  an  eastward 
“thermal  wind”,  as  shown.  A  deformation  field  7  = 
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Figure  6.  Stuari^s  (1967)  solution  portraying  a  steady 
flow  conserving  vorticity.  Streamlines  designate  both 
dimensional  streamlines  ^  —  log  g,  g  =  cosh  y  — 

acosx),  and  lines  of  constant  vorticity  (  =  — 

with  values  indicated  in  each  panel  to  the  left  and 
right,  respectively.  For  a  =  0  the  velocity  profile  is 
u  =  —  tanhy.  Stagnation  points  at  x  =  rb7r,  y  —  0 
are  connected  by  the  stagnation  stream  lines  separat¬ 
ing  the  closed  “core”  circulation  from  the  outer  flow. 
With  increasing  a  a  larger  fraction  of  the  circulation  is 
concentrated  within  the  core. 


at  2.89  7“^  the  density  gradient  at  the  left  flank  de¬ 
velops  Ro*^  =  00,  while  anticyclonic  shear  remains  at 
Ro“  =  —0.3, 

The  crucial  point  is  that  starting  at  a  time  when  Ro"^ 
is  of  order  +1  the  cyclonic  shear  zone  becomes  a  breed¬ 
ing  ground  for  spiral  eddies  long  before  appreciable  an¬ 
ticyclonic  vorticity  has  been  generated.  In  Figure  4  the 
third  panel  has  been  emphasized  because  at  this  time 
the  vertical  shear  at  the  surface  at  the  front  reaches  a 
value  of  du/ dz  =  2N  {N  is  the  buoyancy  frequency)  cor¬ 
responding  to  a  Richardson  number  {N/{du/dz))^  =  | 
and  suggesting  the  onset  of  vertical  shear  instability. 

An  independent  consideration  has  to  do  with  the  vis¬ 
ibility  of  the  spiral  arms,  presumably  the  result  of  the 
alignment  and  concentration  of  surfactants.  Consider 
an  elementary  surface  area  6x6y  at  time  zero.  With  the 
developing  front  the  area  is  elongated  along  the  x-axis 
on  both  flanks  of  the  developing  jet.  But  in  accordance 
with  the  Hoskins  and  Bretherton  theory,  at  the  time 
2.75  7“^  the  area  has  expanded  (by  a  factor  |)  on  the 
anticyclonic  side,  while  it  has  contracted  (to  \  the  orig¬ 
inal  area)  on  the  cyclonic  side.  Thus  the  frontal  theory 
has  the  elements  to  account  for  both  the  visibility  and 
sense  of  rotation  of  the  spiral  eddies.  But  when  exam¬ 
ined  in  detail  the  story  is  not  as  clear-cut  as  presented 
here,  and  we  must  refer  to  MAFZ  for  a  more  complete 
discussion. 

Breaking  Symmetry:  the  Margules 
Front 


du/dx  -  dv/dy  is  superposed,  causing  the  initially  ver¬ 
tical  isopycnals  to  tilt  northward.  Up  to  this  point  there 
has  been  no  breaking  of  symmetry,  ail  directions  can  be 
reversed. 

In  the  subsequent  development  we  use  “north”  and 
“east”  only  for  convenient  reference  to  the  figure.  The 
northward  tilting  of  isopycnals  is  not  uniform;  the 
northern  isopycnals  converge  at  the  surface,  and  the 
southern  isopycnals  diverge.  Accordingly  the  associated 
eastward  thermal  jet  has  a  strong  cyclonic  shear  at  its 
northern  (left)  flank  and  a  weak  anticyclonic  shear  at  its 
southern  (right)  flank.  At  time  2.5  (measured  in  7“^ 
units)  the  associated  Rossby  numbers  are  Ro”^  =  -hi 
and  Ro"  =  “0.3,  respectively. 

At  this  stage  the  underl3dng  rate  of  strain  no  longer 
determines  the  rate  of  development.  Rather,  the  isopy- 
cnal  ’’collapse”  takes  the  form  of  a  “Rossby  Adjust¬ 
ment  Problem”  with  taking  the  place 

of  7~^  «  10“^s"'^  as  the  relevant  time  scale  {Ou  1984). 
In  the  short  time  interval  between  2.5  to  2.75  7”^  the 
cyclonic  shear  grows  from  Ro*^  =  1  to  Ro"^  =  3,  and 


In  the  traditional  Margules  treatment  of  a  sharp 
front,  anticyclonic  fronts  are  ruled  out  by  the  condi¬ 
tion  of  static  stability.  For  a  “softened  Margules  front” 
{MAFZ)  there  is  a  gravitational  stability  limit  to  the 
anticyclonic  shear,  but  no  such  limit  to  the  cyclonic 
case. 

Breaking  Symmetry:  Anticyclonic 
Instabilities 

To  further  confuse  the  issue,  there  is  another  inde¬ 
pendent  set  of  processes  to  explain  the  dominance  of 
cyclonic  vortices.  It  follows  from  the  Rayleigh  crite¬ 
rion  of  stability  (extended  to  include  Coriolos  accel¬ 
eration)  that  cyclonic  circular  vortices  are  stable  and 
anticyclonic  vortices  are  unstable  {MAFZ  6.21),  and 
this  leads  to  an  “inertial  instability”  criterion  Ro  <  -1 
which  goes  back  to  Pedley  (1969).  Oceanographers  are 
familiar  with  the  vertical  shear  (]Wchardson)  instability 
for  du/dz  >  2N  but  surprisingly  unfamiliar  with  the 
horizontal  shear  (Pedley)  instability  for  du/dy  >  /. 

There  is  ample  numerical  evidence  for  instabilities 
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Figure  7.  Cartoon  for  the  generation  of  ocean  spirals 
(see  text). 


that  impede  the  development  of  anticyclonic  vortices 
but  have  no  such  effect  on  cyclonic  vortices.  {Lesieur  et 
al.j  1991,  Potylitsin  and  Peltier  1998).  The  distinction 
is  equally  clear  in  laboratory  experiments  {Bidokhti  and 
Tritton  1992).  Kloosterziel  (1990)  observed  that  “it  is 
virtually  impossible  to  create  anticyclonic  vortices  ...by 
simply  stirring  the  fluid  locally.  But  cyclonic  vortices 
are  easily  created  in  this  way!  In  the  anticyclonic  case 
stirring  ....  leads  to  turbulent  motion  and  the  genera¬ 
tion  of  waves  whereas  by  cyclonic  stirring  a  well-defined 
smooth  vortex  forms.” 

With  regard  to  question  B,  it  is  unresolved  whether 
the  dominance  of  cyclonic  vortices  is  associated  with 
a  dominance  in  cyclonic  horizontal  shear  early  in  the 
formation  process,  or  with  the  relative  instability  of  an¬ 
ticyclonic  vortices  in  the  mature  stage. 


Stirring  and  Mixing 

The  proposed  formation  of  spiral  eddies  is  cartooned 
in  Figure  7.  The  geostrophically  balanced  ambient 
ocean  vorticity  of  order  ±10"^/  is  enhanced  by  local 
frontogenic  processes,  with  concentration  of  surfactant 
along  a  converging  line.  When  the  frontal  shear  be¬ 
comes  comparable  to  f,  instabilities  lead  to  cross-frontal 
flow  accompanied  by  a  cat’s-eye  circulation  pattern. 
The  cat’s-eye  circulation  twists  the  convergence  line  and 
neighboring  linear  features  into  a  cyclonic  spiral  which 
stretches  and  further  thins  the  lines  of  surfactant  con¬ 
centration.  It  is  a  text-book  example  of  horizontal  stir¬ 
ring. 

We  have  not  made  any  serious  attempt  at  estimat¬ 
ing  the  relevant  diffusivity.  The  best  we  can  do  is  a 
dimensional  argument  for  «;  =  ci'u'  with  c  =  |  (typical 
for  turbulent  shear  flows,  Tennekes  and  Lumley  1972). 
Taking  the  minor  axis  of  a  Stuart  core  as  the  charac¬ 
teristic  length  Z',  the  maximum  velocity  along  the  stag¬ 
nation  streamline  for  u'  and  a  =  1  for  a  mature  20  km 
spiral  yields  k  =  lO^m^  s”^  {MAFZ). 


Testing  the  Hypothesis 

Our  hypothesis  is  based  on  observational  material 
which  consists  almost  entirely  of  unrelated  glimpses  in 
a;,y-space  on  the  sea  surface.  For  a  satellite  in  a  low 
earth  orbit  (LEO)  a  given  point  remains  within  view 
for  only  about  6  s.  What  is  required  here  are  prolonged 
stares  or  frequent  repeat  visits  coordinated  with  ship¬ 
board  observations  (Figure  8).  We  cannot  think  of  any 
a;,y,  ocean  processes  that  had  been  properly  iden¬ 
tified  from  measurements  in  half  the  coordinate  space. 
We  must  assume  that  there  are  serious  flaws  in  the  fore¬ 
going  presentation. 


Figure  8.  The  proposed  experiment.  SAR  imagery 
from  an  overhead  drone  is  examined  by  the  authors  on 
shipboard  in  real  time.  The  image  shows  the  position 
of  the  vessel  which  is  about  to  enter  a  spiral  streak. 

Following  his  41-G  space  mission  in  October  1984. 
Scully- Power  (1986)  wrote:  “The  almost  ubiquitous  oc¬ 
currence  (of  spiral  eddies),  whenever  submesoscale  dy¬ 
namics  was  revealed  in  the  sun  glitter,  indicates  that 
they  are  perhaps  the  most  fundamental  entity  in  ocean 
dynamics  at  this  scale.  The  difficulty  is  in  explaining 
their  structure.”  The  only  serious  attempt  at  analysis 
has  been  in  a  Norwegian  Doctoral  dissertation  which  ex¬ 
plores  baroclinic  instabilities  in  a  narrow  cyclonic  shear 
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zone  {Eldevik  and  Dysthe  1999).  Why  has  the  problem 
received  so  little  attention  in  the  thirty  years  since  dis¬ 
covery?  We  assert  that  the  fashion  during  these  years 
has  been  statistical  rather  than  phenomenological  de¬ 
scriptions  of  ocean  features,  and  here  we  are  concerned 
with  a  truly  phenomenological  problem. 
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Abstract.  The  last  decade  has  seen  a  tremendous  increase  in  the  number  and 
quality  of  observations  of  horizontal  structure  in  the  upper  ocean.  Many  of  these 
data  have  been  made  possible  by  the  widespread  use  of  towed  vehicles  and  ship¬ 
board  Doppler  sonars.  We  review  what  these  observations  have  revealed  about 
processes  in  the  upper  ocean,  with  special  emphasis  on  mid-latitudes  and  hori¬ 
zontal  scales  of  1-100  km.  Fronts  are  one  of  the  most  prominent  features  of  the 
mixed  layer  on  these  scales.  The  vertical/across-ffont  circulation  created  at  a 
front  may  be  quantified  assuming  quasigeostrophic  dynamics.  Typical  of  the 
mixed  layer  are  fronts  that  are  warm  and  salty  on  one  side,  and  cool  and  fresh  on 
the  other  such  that  the  density  contrast  across  the  front  is  small.  The  ubiquitous 
existence  of  such  compensated  fronts  suggests  that  horizontal  mixing  in  the 
mixed  layer  is  an  increasing  function  of  the  horizontal  density  gradient.  Argu¬ 
ments  based  on  the  conservation  of  potential  vorticity  indicate  that  10-km-scale 
(roughly  the  Rossby  radius  of  deformation)  eddies  in  the  mixed  layer  should  be 
predominantly  cyclonic.  Recent  observations  of  mixed-layer  vorticity  are  posi¬ 
tively  skewed. 


Introduction 

The  database  of  small  horizontal  scale  upper  ocean 
observations  has  grown  significantly  in  recent  years  due  to 
the  increased  use  of  towed  vehicles  (such  as  SeaSoar), 
shipboard  Acoustic  Doppler  Current  Profiler  (ADCP)  and 
Global  Positioning  System  (GPS)  navigation.  A  brief  re¬ 
view  of  these  measurements  is  the  purpose  of  this  paper, 
with  special  emphasis  on  the  mid-latitude  open  ocean  on 
scales  of  1-100  km. 

Relevant  characteristics  of  the  instruments  used  in  upper 
ocean  surveying  are  as  follows.  SeaSoar  is  a  towed  vehicle 
capable  of  cycling  from  the  surface  to  over  300  m  every  3- 
4  km  at  a  tow  speed  of  4  m  s  '.  SeaSoar  usually  carries  a 
CTD,  providing  measurements  of  temperature,  salinity, 
and  pressure.  A  150-kHz  ADCP  in  typical  use  allows  hori¬ 
zontal  resolution  of  velocity  quite  similar  to  that  of  a  Sea¬ 
Soar,  with  vertical  coverage  of  200-300  m.  High  quality 
GPS  navigation  is  a  key  part  of  the  surveying  system,  es¬ 
pecially  for  retrieving  the  best  absolute  velocity.  P-code 
receivers,  in  use  on  many  research  vessels,  offer  positional 
accuracy  of  about  3  m. 

Observations  are  used  to  address  three  issues.  (1)  The 
anatomy  of  an  upper  ocean  front  is  discussed,  including 
the  vertical/across-ffont  circulation  and  associated  heat 
flux.  (2)  Compensation  of  horizontal  temperature  and  sa¬ 
linity  gradients  in  the  mixed  layer  is  shown  common.  (3) 


Cyclonic  vorticity  is  preferred  at  small  horizontal  scales  in 
the  mixed  layer.  For  each  of  the  above  issues,  we  discuss 
what  the  observations  suggest  about  mixing  in  the  ocean. 

The  anatomy  of  an  upper  ocean  front 

The  most  common  use  of  the  SeaSoar/ADCP/GPS  sys¬ 
tem  has  been  to  survey  individual  features  in  the  upper 
ocean.  Such  a  survey  of  the  Azores  Front  in  the  North  At¬ 
lantic,  done  in  March  1992,  demonstrates  many  phenom¬ 
ena  common  to  oceanic  fronts  [Rudnick  and  Luyten,  1996]. 
The  survey,  consisting  of  a  series  of  north/south  sections 
separated  by  one-quarter  of  a  degree  in  longitude,  reveals  a 
sharp  density  front  (Figure  1).  Horizontal  velocity  ap¬ 
proaches  0.5  m  s  ',  and  is  generally  directed  parallel  to 
isopycnals.  This  intuitive  result  has  two  implications.  First, 
surveying  techniques  are  accurate  enough  that  coherent 
velocity  and  density  fields  can  be  measured.  Second,  and 
more  importantly,  the  rate  of  change  and  vertical  advection 
of  density  must  be  small.  The  velocity  field  is  in  the  form 
of  a  jet  strongest  at  the  density  front.  The  jet,  and  its  asso¬ 
ciated  straining,  causes  streamers  in  the  alongffont  direc¬ 
tion.  Enhanced  horizontal  variability  is  typical  of  frontal 
regions  [Ferrari  and  Rudnick,  2000]. 
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Figure  1.  Potential  density  and  horizontal  velocity  in  the  mixed 
layer  at  the  Azores  Front.  The  contour  interval  is  0.025  kg  m‘^ 
and  the  velocity  scale  is  shown.  Note  the  jet  directed  along  the 
density  fi'ont. 


The  vertical/across-front  structure  is  seen  in  a 
north/south  section  along  24.25°W  (Figure  2).  Mixed-layer 
depth  trends  from  about  100  m  deep  in  the  northern  part  of 
the  section  to  roughly  150  m  in  the  south.  The  mixed-layer 
base  is  marked  as  a  region  of  strong  stratification.  The 
front  is  apparent  as  isopycnals  slope  upward  and  to  the 
north  across  the  mixed  layer.  One  can  think  of  the  structure 
almost  as  a  piece  of  the  mixed-layer  base  peeling  off  and 
intersecting  the  surface.  Because  the  survey  is  from  late 
winter,  following  a  period  of  relative  calm,  restratification 
is  just  beginning.  A  shallow  stratified  layer  lies  over  the 
mixed  layer  north  of  the  front.  Isopycnals  retain  a  strong 
slope  in  the  thermocline  beneath  the  surface  front.  The 
Azores  Front  is,  in  fact,  a  permanent  feature  that  penetrates 
to  a  depth  of  2000  m  [Gould,  1985].  There  is  some  sugges¬ 
tion  of  subduction,  as  mixed-layer  water  from  the  north 
side  of  the  front  appears  to  slide  southward  and  below  the 
outcropping  front. 

The  simplest  dynamical  balance  one  can  imagine  testing 
is  thermal  wind.  Such  a  test  is  possible  by  comparing  the 
independently  measured  fields  of  velocity  shear  and  den¬ 
sity  (Figure  2).  Strong  shear  of  the  correct  sign  is  apparent 
along  the  outcropping  isopycnals  in  the  mixed  layer,  and 
downward  and  southward  beneath  the  mixed  layer.  Alter¬ 
nating  bands  of  positive  and  negative  shear  are  apparent  at 
the  mixed  layer  base  north  of  the  front.  These  shears  are 


clearly  out  of  thermal  wind  balance,  and  are  likely  near- 
inertial. 


Lalitude 


Figure  2.  North/south  section  of  potential  density  and  vertical 
shear  of  velocity.  The  contour  interval  of  density  is  0.025  kg  nv\ 
Note  the  high  shear  along  the  sloping  front. 


The  vertical  velocity  field  may  be  inferred  assuming 
quasi-geostrophic  dynamics.  This  assumption  is  reasonable 
because  the  front  is  observed  to  be  nearly  in  thermal  wind 
balance.  Derivation  of  the  resulting  omega  equation 
[Hoskins  et  ai,  1978]  relies  on  the  elimination  of  the  time 
derivative  terms  in  the  momentum  and  density  equations. 
The  omega  equation  can  be  written 

V=(A/-vr)+/-|^  =  2VQ,  (1) 

UZ 


where 


Q  = 


Po 


„  du„  „ 
-^■Vp.^Vp 


dx 


dv 


(2) 


and  u^,  is  geostrophic  velocity,  the  divergence  operator  is 
two-dimensional  and  other  symbols  have  their  usual  mean¬ 
ing.  Forcing  of  the  omega  equation  is  a  function  of  the 
density  and  geostrophic  velocity  fields. 

The  elliptic  omega  equation  is  solved  for  the  survey  of 
the  Azores  Front  [Rudnick,  1996].  A  band  of  downwelling 
exists  near  and  to  the  north  of  the  front  (Figure  3).  Upwell- 
ing  appears  on  the  south  side  of  the  front  especially  at 
about  24°W.  A  region  of  strong  downwelling  occurs  in  the 
SW  comer  where  the  flow  is  southward.  The  tendency  for 
downwelling  on  the  dense  (cold)  side  of  the  front  and  up- 
welling  on  the  less  dense  (warm)  side  results  in  a  net  heat 
flux  averaged  over  the  horizontal  area  of  the  survey.  The 
heat  flux  peaks  at  the  base  of  the  mixed  layer  at  about  15 
W  m'^  This  heat  flux  is  comparable  to,  or  larger  than,  cli¬ 
matological  values  compiled  on  1°  grids  [Marshall  et  al, 
1993]. 
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Figure  3.  Vertical  velocity  and  potential  density  in  the  mixed 
layer  at  the  Azores  Front.  The  contour  interval  for  potential  den¬ 
sity  is  0.025  kg  m'^.  The  white  contour  indicates  zero  vertical 
velocity.  Note  the  band  of  downwelling  in  and  to  the  north  of  the 
front. 

A  simple  theory  of  a  semi-geostrophic  front  [Hoskins 
and  Bretherton,  1972]  helps  to  understand  the  observa¬ 
tions.  The  two-dimensional  model  relies  on  the  assumption 
that  the  across-front  momentum  balance  is  exactly 
geostrophic,  but  the  alongfront  balance  includes  horizontal 
and  vertical  advection  (see  Appendix).  The  front  is  formed 
by  a  large-scale  deformation  of  an  initially  vertically  uni¬ 
form  layer  with  weak  horizontal  density  gradients.  The 
solution  (Figure  4)  is  shown  at  time  4/a,  where  a  is  the 
rate  of  strain.  The  large-scale  strain  estimated  from  the 
velocity  field  of  Figure  1  is  approximately  10'^  s'',  so  the 
dimensional  time  would  be  46  days.  Sloping  isopycnals 


Figure  4.  The  semi-geostrophic  model  of  frontogenesis  at  time 
4/a  Black  lines  are  isopycnals  with  a  contour  interval  of  0.2  kg 
m’^  Lines  of  constant  along-ffont  velocity  (blue)  have  an  inter¬ 
val  of  0.1  m  s''.  The  red  lines  show  the  positions  of  particles  ini¬ 
tially  at  10  and  50  m. 
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are  pinched  at  the  surface  forming  a  front.  An  essential 
feature  of  this  model  is  that  an  infinitely  sharp  front  is 
formed  in  a  finite  time.  Alongfront  velocity  is  in  the  form 
of  a  surface  intensified  jet  with  the  dense  (positive  vortic- 
ity)  side  sharper  than  the  light  (negative  vorticity)  side.  As 
the  solution  is  Lagrangian,  it  is  a  simple  matter  to  deter¬ 
mine  the  position  of  particles.  The  initial  10  and  50  m  sur¬ 
faces  have  been  stirred  to  be  multi-valued  as  a  function  of 
horizontal  position.  Strong  localized  downwelling  occurs 
on  the  dense  side  of  the  front  with  weaker,  more  diffuse, 
upwelling  on  the  less  dense  side.  This  pattern  of  vertical 
velocity  is  consistent  with  the  observations. 

Temperature/salinity  compensation 

Mixed-layer  thermohaline  fronts  arc  often  observed  to 
have  a  gradient  from  warm  and  salty  to  cool  and  fresh  such 
that  the  density  contrast  across  the  front  is  small.  This  phe¬ 
nomenon  is  termed  compensation  because  temperature  and 
salinity  compensate  in  their  effects  on  density.  Compensa¬ 
tion  has  been  known  for  some  time  at  large  and  meso 
scales  [Roden,  1975;  Roden,  1984].  More  recently,  com¬ 
pensation  has  been  shown  to  exist  at  horizontal  scales  as 
small  as  10  m  [Ferrari  and  Rudnick,  2000;  Rudnick  and 
Ferrari,  1999]. 

The  existence  of  compensation  in  the  mixed  layer  is 
consistent  with  theory  proposed  by  Young  [1994]  and  Fer¬ 
rari  and  Young  [1997].  Consider  a  vertically  mixed  layer 
with  an  initially  random  distribution  of  temperature  and 
salinity.  Temperature  and  salinity  gradients  compensate  in 
their  effect  on  density  in  some  regions,  while  in  others 
density  gradients  exist.  Density  gradients  slump  due  to 
gravity,  creating  sloping  isopycnals.  Vertical  mixing  re¬ 
sults  in  weakened  horizontal  density  gradients.  This  proc¬ 
ess  is  essentially  shear  dispersion  [Taylor,  1953]  where  the 
shear  is  due  to  slumping  density  gradients,  and  the  mixing 
is  caused  by  any  of  the  processes  that  make  the  mixed 
layer  vertically  uniform.  Temperature/salinity  gradients 
that  are  compensated  do  not  slump  and  therefore  do  not 
experience  shear  dispersion.  The  net  effect  is  that  density 
gradients  diffuse  while  compensated  gradients  persist. 

Compensation  is  manifest  as  a  strong  correlation  be¬ 
tween  temperature  and  salinity  in  a  horizontal  SeaSoar  tow 
in  the  N.  Pacific  winter  mixed  layer  [Ferrari  et  ai,  this 
volume].  Temperature  and  salinity  fluctuations  coincide  at 
all  resolved  scales.  The  observations  are  thus  consistent 
with  a  horizontal  diffusivity  that  is  an  increasing  function 
of  horizontal  density  gradient. 

An  important  question  is  under  what  conditions  is  com¬ 
pensation  typical  of  the  world’s  oceans.  This  question  is 
addressed  using  SeaSoar  data  from  the  midlatitude,  north¬ 
ern  hemisphere  Pacific,  Atlantic,  and  Indian  Oceans  during 
all  seasons  [Rudnick  and  Martin,  2001].  The  relevant 
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quantity  is  the  density  ratio  /?,  defined  as  the  ratio  of  the 
effect  of  a  horizontal  change  of  temperature  on  density 
divided  by  that  of  salinity  on  density.  For  a  compensated 
thermohaline  front,  /?=!. 


Figure  5.  The  conditional  probability  density  funetion  (pdf)  of 
mixed-layer  density  ratio  as  a  function  of  mixed-layer  depth  from 
the  global  ocean.  Each  row  of  bars  represents  the  conditional  pdf 
for  the  given  mixed-layer  depth.  Height,  and  color  shading,  of  the 
bars  indicate  the  pdf  magnitude.  The  density  ratio  is  typically  1 
for  mixed  layers  deeper  than  75  m. 


The  conditional  probability  density  function  (pdf)  of 
mixed-layer  density  ratio  as  a  function  of  mixed-layer 
depth  summarizes  results  from  the  global  ocean  (Figure  5). 
For  mixed-layers  deeper  than  75  m  a  density  ratio  near  1  is 
typical,  while  shallower  mixed  layers  have  a  poorly  de¬ 
fined  density  ratio.  The  shallowest  mixed  layers  are  tem¬ 
perature  dominated  as  the  density  ratio  approaches  infinity. 
Mixed-layer  depth  may  be  considered  a  proxy  for  the 
strength  of  vertical  mixing.  The  results  suggest  that  verti¬ 
cal  mixing  is  essential  to  the  establishment  of  compensa¬ 
tion  in  the  mixed  layer.  This  observation  is  consistent  with 
the  slumping  and  mixing  model  where  vertical  mixing  is 
an  essential  process.  On  the  other  hand,  compensation 
should  not  be  expected  in  shallow  mixed  layers  where  ver¬ 
tical  mixing  is  weak. 

A  density  ratio  of  two  is  typical  of  the  thermocline  be¬ 
neath  the  mixed  layer  [Schmitt,  1981].  This  phenomenon  is 
believed  to  be  caused  by  salt  fingering  [Schmitt,  1994]. 
What  is  remarkable  is  the  sharpness  of  the  transition  from 
R=\  to  R~2  across  the  mixed-layer  base,  especially  when 
the  mixed  layer  is  deep  [Ferrari  and  Rudnick,  2000;  Rud- 
nick  and  Martin,  2001].  The  change  in  density  ratio  from 
the  mixed  layer  to  the  thermocline  is  a  reflection  of  the 
different  mixing  processes  working  in  the  two  regions. 


Mixed-layer  vorticity 


Rapidly  rotating  vortices  in  the  ocean  arc,  in  theory, 
predominantly  cyclonic.  A  simple  explanation  invokes  the 
conservation  of  potential  vorticity  as  in  the  two- 
dimensional  Hoskins  and  Bretherton  semigeostrophic 
model  (Figure  4).  The  potential  vorticity  for  this  model  is 


Q 


dp  dv 

J^xTz 


(3) 


The  initial  state  of  the  model  is  one  of  zero  q  as  the  layer  is 
vertically  uniform  and  velocities  arc  zero.  As  the  front 
sharpens,  gradients  of  density  and  velocity  increase,  but  q 
must  remain  zero.  Because  of  thermal  wind,  the  second 
term  above  is  always  positive  (in  the  northern  hemisphere). 
To  conserve  q  then,  the  first  term  must  be  negative.  As 
static  stability  requires  dp/dr<0 ,  absolute  vorticity  must 

remain  positive.  If  the  relative  vorticity  is  to  be  the  magni¬ 
tude  of  /  or  larger,  it  must  thus  be  cyclonic. 

The  discussion  above  has  relevance  to  the  symmetric  in¬ 
stability  [Hoskins,  1974].  The  symmetric  instability  con¬ 
siders  an  exactly  circular  vortex,  so  a  two-dimensional 
model  (radial/vcrtical)  is  sufficient.  Instability  occurs  if  the 
potential  vorticity,  given  by  (3),  is  positive.  Hoskins  points 
out  that  a  statically  stable,  motionless  state  cannot  evolve 
frictionally  and  adiabatically  to  a  state  subject  to  symmet¬ 
ric  instability. 

The  relevant  length  scale  in  the  ocean  is  the  Rossby  ra¬ 
dius  of  deformation  for  the  mixed  layer,  of  order  10  km. 
The  impediment  to  making  accurate  measurements  of 
small-scale  vorticity  using  shipboard  ADCP  data  has  been 
navigation.  With  the  advent  of  military  grade  (P-codc) 
GPS,  this  problem  has  been  solved. 

Vorticity  is  estimated  using  data  of  3-km  horizontal 
resolution  from  a  cruise  in  the  North  Pacific  during  the 
winter  of  1997  [Rudnick,  2001].  Because  a  SeaSoar  was 
used,  underway  estimates  of  mixed-layer  depth  are  avail¬ 
able.  The  pdf  of  mixed-layer  relative  vorticity  normalized 
by  planetary  vorticity  has  a  mode  near  zero  and  is  posi¬ 
tively  skewed  (Figure  6).  The  skewness  is  apparent  as  a 
relatively  thick  tail  at  a  normalized  vorticity  of  0.5.  The 
skewness  of  vorticity  is  0.36,  and  is  considered  statistically 
significant.  The  skewed  vorticity  is  likely  the  velocity  sig¬ 
nature  of  spiral  eddies  seen  in  photographs  of  the  sea  sur¬ 
face  [Munk  ef  a!.,  2000].  The  contribution  of  these  eddies 
to  mixing  in  the  ocean  is  an  open  question. 
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Figure  6.  Probability  density  function  (pdf)  of  relative  vorticity 
normalized  by  planetary  vorticity  in  the  mixed  layer.  A  normal 
distribution  with  zero  mean  and  standard  deviation,  0. 1 8,  equal  to 
the  observations  is  shown  (solid  line).  The  skewness,  equal  to 
0.36,  is  apparent  as  the  relatively  high  value  of  the  pdf  near  a 
normalized  vorticity  of  0.5. 
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Dt  dt  dx  dy  oz 


Here,  y  is  the  along-front  direction,  z  is  vertical,  po  is  a 
constant  reference  density,  and  /is  a  constant  Coriolis  pa¬ 
rameter.  Ertel’s  potential  vorticity  (3)  is  conserved  follow¬ 
ing  fluid  particles 


^  =  0 
Dt 


(A6) 


Summary 

Shipboard  surveys  using  SeaSoar  and  ADCP  have 
greatly  improved  our  Icnowledge  of  small-scale  horizontal 
variability  in  the  upper  ocean.  Oceanic  fronts  are  observed 
to  be  in  approximate  thermal  wind  balance.  A  restratifying 
across-front  circulation  can  be  inferred,  implying  a  heat 
flux  of  order  1 0  W  m‘^  when  averaged  over  a  1 00  km“  re¬ 
gion  surrounding  the  front.  Temperature/salinity  compen¬ 
sation  is  typical  in  the  mixed  layer,  especially  when  the 
mixed  layer  is  deep.  The  observed  compensation  is  consis¬ 
tent  with  the  slumping  and  mixing  model  of  Young  and 
colleagues,  and  a  horizontal  diffusivity  that  is  a  growing 
function  of  density  gradient.  Mixed-layer  vorticity  is 
skewed  positive  because  the  largest  vorticities  are  positive. 
The  observed  skewness  is  consistent  with  potential  vortic¬ 
ity  conservation. 


Assume  a  solution  consisting  of  a  geostrophic  confluent 
flow  plus  a  two-dimensional  component. 


u ——ax-\‘u\x^zj') 

(A7) 

v-ay  +  v\x^zj^ 

(A8) 

(A9) 

p^p{x,z,t) 

(AlO) 

II 

(All) 

Note  that  the  parameter  a  may  be  a  function  of  time. 
Substitution  of  (A7-1 1)  into  (Al-5)  yields  the  equations 

—  +  aM=0  (A12) 

Dt 


Appendix 


du  dw 


(A13) 


The  following  is  a  review  of  the  Hoskins  and  Bretherton 
[1972]  two-dimensional  semi-geostrophic  model  of  fron- 
togenesis.  A  novelty  is  that  the  solution  is  in  fully  Lagran- 
gian  coordinates,  in  contrast  to  the  mixed  Eule- 
rian/Lagrangian  solution  in  Hoskins  and  Bretherton. 

The  semi-geostrophic  equations  appropriate  for  a  front 
in  the  :r-direction  are 

Po 


where 


_  g  dp 
dz  Po  dx 

M  =  v'  +  fx  . 


(AM) 


These  together  with  conservation  of  density  (A4)  are  the 
equations  we  need  to  solve. 


(Al) 
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The  prognostic  equations  (A  12)  and  (A4)  can  be  solved 
immediately  in  Lagrangian  coordinates,  with  no  explicit 
dependence  on  along-front  position, 

A/=A/o(xo,zJe'^  (A  15) 

p  =  p(x„,2o),  (A16) 


where 


0 


and  ^0  and  zq  are  the  initial  across-front  and  vertical  posi¬ 
tions  of  particles. 

A  useful  identity  expresses  the  Eulerian  spatial  gradient 
of  a  dependent  variable  h  in  Lagrangian  coordinates. 


(dh\ 

d{x,z) 

1 

o 

d{x„z,)_ 

(A  17) 


where  the  subscript  z  on  the  Ihs  makes  clear  that  z  is  held 
constant.  Using  (A  17),  (A13-14)  can  be  written  in  Lagran¬ 
gian  coordinates  as 


---/3 


(A  18) 

(A  19) 


The  solution  in  Lagrangian  coordinates  (A  15- 16)  is 
simple.  The  challenge  is  the  transformation  back  to  Eule¬ 
rian  coordinates  (A  18- 19).  These  equations  are  nonlinear, 
and  possibly  difficult  to  solve  depending  on  the  initial  and 
boundary  conditions  used. 

Potential  vorticity  may  be  written 

_l_^,p)^_l_^o,p) 

p,d{x,z)  Po5(x„,z„)’ 

and  is  necessarily  conserved,  as  can  be  readily  seen  by  its 
expression  in  Lagrangian  coordinates. 

Consider  a  layer  initially  vertically  uniform  in  density. 
Suppose  also  that  the  initial  horizontal  density  gradient  is 
so  small  that  along-front  velocity  and  its  vertical  shear  are 
negligible.  Using  these  initial  conditions,  solutions  are 

(A21) 

P  =  pM-  (A22) 


Substituting  these  into  the  thermal  wind  equation  (A  19) 
and  integrating  yields 


x  =  z 


(A23) 


where  is  to  be  determined  from  boundary  condi¬ 

tions.  Derivatives  of  x  with  respect  to  xo  and  zq  can  be  de¬ 
rived  from  (A23)  and  substituted  into  the  continuity  equa¬ 
tion  (A1 8).  Integrating  the  result  with  respect  to  Zq  gives 


_g 


~ Zq€  ^  +  5(xQ,r). 


(A24) 


We  use  boundary  conditions  that  particles  on  the  top 
and  bottom  boundaries  remain  there 


z{z^=±H)  =  ±H .  (A25) 


Evaluating  (A24)  at  the  boundaries,  and  using  (A23), 
yields  expressions  for  the  integration  constants 

A  =  x^e'^  (A26) 


2f-Po  dxl 

The  solutions  for  x  and  z  are  thus 


x  =  x.e  ^  +Z‘ 


g  dp 
fpo  ' 


(A27) 


(A28) 

(A29) 


The  solution  in  Figure  4  has  an  initial  density  profile  that  is 
an  arctan  with  a  length  scale  of  500  km,  in  a  layer  200  m 
deep. 

A  few  features  of  the  solution  are  worthy  of  note.  The 
absolute  vorticity  is 


dM 

dx 


1  +  z 


/Vo  ^^0 


(A30) 


Vorticity  becomes  infinite  in  a  finite  time,  provided  the 
initial  density  profile  has  nonzero  curvature.  This  will  hap¬ 
pen  first  on  the  boundaries. 

The  slope  of  an  isopycnal. 


if)  -  s  dp  g 
dz  I,  fPodx^ 


(A31) 


depends  only  on  xq  and  /,  so  the  isopycnal  is  a  straight  line. 
The  Richardson  number  is  given  by 


Ri  = 


(dv/dzy 


f 

dM/dx 


(A32) 


Therefore,  (A30)  implies  that  Ri  becomes  zero  in  a  finite 
time  in  the  region  of  strong  vorticity  and  subduction  at  the 
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front.  Ri  is  1  elsewhere.  Mixing  must  become  important  as 
the  front  sharpens. 
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Abstract.  The  surface  mixed  layer  of  the  ocean  is  often  characterized 
by  density  compensation  between  the  horizontal  temperature  and  salinity 
gradients.  In  this  contribution  we  present  a  combination  of  theoretical 
arguments  and  numerical  simulations  to  investigate  how  compensation  might 
emerge  as  a  result  of  processes  at  work  within  the  mixed  layer.  The  djmamics 
of  the  mixed  layer  are  investigated  through  a  simple  model.  The  model 
consists  of  a  pair  of  coupled  advection-diffusion  equations  for  heat  and  salt. 
The  coupling  arises  through  a  nonlinear  diffusion  operator  proportional  to  the 
buoyancy  gradient,  which  parameterizes  the  combined  effect  of  slumping  and 
mixing  of  small-scale  horizontal  buoyancy  gradients.  Nmnerical  solutions  of 
the  mixed  layer  model  show  that  the  nonlinear  diffusion  creates  compensation 
between  the  temperature  and  salinity  gradients,  while  the  stirring  field 
maintains  alignment  between  the  two  gradients.  The  results  of  this  work 
suggest  a  new  parameterization  of  the  horizontal  fluxes  of  heat  and  salt  for 
numerical  models  of  the  mixed  layer. 


1.  Introduction 

Observations  show  that  the  thermohaline  structure 
of  the  surface  mixed  layer  (ML)  of  the  ocean  is  largely 
compensated.  In  other  words,  temperature  and  salin¬ 
ity  fronts  coincide  so  that  the  resulting  density  contrasts 
are  small  relative  to  the  individual  contributions  of  heat 
and  salt.  This  phenomenon  has  been  known  for  some 
time  for  certain  fronts  at  scales  of  a  few  tens  to  one 
hundred  kilometers  {Roden,  1975;  Rudnick  and  Luyten, 
1996).  Recent  high-resolution  observations  have  shown 
that  compensation  exists  down  to  horizontal  scales  of 
tens  of  meters  in  the  North  Pacific  (Rudnick  and  Fer¬ 
rari,  1999;  Ferrari  and  Rudnick,  2000)  and  throughout 
the  global  ocean  on  scales  of  kilometers  {Rudnick  and 
Martin,  2001).  An  example  from  a  horizontal  tow  in  the 
ML  of  the  Subtropical  North  Pacific  is  given  in  Figure  1. 
Notice  how  almost  all  fluctuations  of  temperature  are 
mirrored  in  salinity  so  that  density  gradients  are  mini¬ 
mized. 

One  explanation  of  these  observations  is  that  atmo¬ 
spheric  forcing  conspires  to  create  and  juxtapose  water 
masses  with  compensating  properties.  However  the  ra¬ 
tio  of  heat  to  freshwater  density  fluxes  is  variable  in 
large  scale  maps  {Schmitt  et  al,  1989)  and  in  time  se¬ 
ries  at  a  point  {Weller  et  al.,  1985),  so  internal  ocean 


dynamics  is  required  to  accotmt  for  the  observed  com¬ 
pensation.  Young  (1994)  and  Ferrari  and  Young  (1997) 
propose  a  more  satisfactory  explanation  that  relies  on 
regulating  mechanisms  at  work  in  the  ML.  These  theo¬ 
retical  arguments  suggest  that  compensation  is  the  re¬ 
sult  of  the  preferential  diffusion  of  horizontal  density 
gradients  which  occurs  because  of  the  combined  action 
of  unbalanced  motions  and  vertical  mixing. 

The  physical  explanation  of  the  theory  of  Young  and 
collaborators  is  as  follows.  Horizontal  gradients  of  tem¬ 
perature  and  salinity  can  arise  in  the  ML  in  response 
to  non  homogeneous  atmospheric  forcing  and  entrain¬ 
ment  of  thermocline  waters.  At  some  locations  temper¬ 
ature  and  salinity  will  compensate  each  other  exactly, 
whereas  in  other  locations  temperature  and  salinity  will 
create  strong  horizontal  density  gradients.  Much  of  the 
ML  will  lie  between  these  two  extremes.  The  strong 
density  gradients  slump  under  the  action  of  gravity  and 
tend  to  restratify  the  ML.  Vertical  mixing  eventually  ar¬ 
rests  this  unbalanced  motion  by  remixing  the  ML.  This 
mechanism  is  essentially  thermohaline  shear  dispersion, 
where  the  shear  is  driven  by  the  horizontal  density  gra¬ 
dient,  and  the  vertical  mixing  results  from  the  variety 
of  processes  that  mix  the  ML.  On  the  other  hand,  com¬ 
pensated  fronts  are  balanced  and  therefore  do  not  expe¬ 
rience  shear  dispersion.  The  net  result  is  that  density 
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fronts  are  diflFused,  while  compensated  fronts  persist. 

The  preferential  diffusion  of  horizontal  density  gradi¬ 
ents  can  be  represented  with  mixing  parameterizations 
in  which  the  transport  of  heat  and  salt  depends  nonlin- 
early  on  the  density  gradient,  e.g.,  with  diffusivities  pro¬ 
portional  to  some  power  of  the  density  gradient  ( Young, 
1994;  Ferrari  and  Young,  1997).  In  this  paper  we  ex¬ 
amine  the  establishment  of  thermohaline  compensation 
by  implementing  these  nonlinear  diffusive  parameteri¬ 
zations  in  a  simple  model  of  the  ML.  Numerical  solu¬ 
tions  show  that  the  thermohaline  structure  of  the  ML 
is  generated  by  a  balance  between  the  mesoscale  strain¬ 
ing  field,  that  acts  to  increase  temperature  and  salinity 
gradients,  and  the  nonlinear  diffusion,  that  arrests  the 
formation  of  density  gradients  but  not  of  compensated 
gradients.  In  agreement  with  observations,  temperature 
and  salinity  gradients  tend  to  be  aligned,  because  both 
heat  and  salt  are  advected  by  the  same  straining  field, 
and  compensated. 

The  mechanism  of  compensation  described  above  im¬ 
plicate  vertically  sheared  currents  within  the  ML  and  it 
is  not  included  in  numerical  models  with  bulk  MLs.  In 
the  last  section  of  the  paper  we  show  how  to  simplify 
the  nonlinear  diffusive  parameterization  so  that  it  can 
be  implemented  in  ocean  circulation  models  to  improve 
the  representation  of  ML  thermohaline  dynamics. 

The  paper  is  organized  as  foUows.  In  section  2,  we 
revisit  the  arguments  of  Yoimg  and  collaborators  in  the 
context  of  the  parameterization  of  diapycnal  fluxes  in 
the  ML.  In  section  3,  we  describe  numerical  simulations 
used  to  test  the  nonlinear  diffusive  parameterization  of 
heat  and  salt  transports  in  the  ML.  In  section  4,  we 
suggest  a  simplified  version  of  the  nonlinear  diffusive 
parameterization  to  be  implemented  in  bulk  ML  mod¬ 
els.  Finally,  conclusions  are  offered  in  section  5. 

2.  Horizontal  transport  of  heat  and  salt 
in  the  mixed  layer 

Let  us  consider  the  dispersion  of  some  tracer  of  con¬ 
centration  6  in  the  ML.  We  model  the  ML  as  a  vig¬ 
orously  mixed,  shallow  layer,  characterized  by  a  small 
aspect  ratio,  i.e.,  with  a  depth  H  much  less  than  the 
horizontal  scale.  The  main  point  here  is  that  there  are 
two  very  different  time  scales:  a  fast  time  scale  ry  over 
which  the  layer  is  mixed  vertically  over  the  depth  H 
and  a  longer  time  scale  associated  with  horizontal 
transports. 

The  mathematical  model  for  the  transport  of  a  tracer 
6  stirred  by  an  incompressible  velocity  field  u  is  the 
familiar  advection-diffusion  equation, 

(1) 


together  with  appropriate  boundary  conditions.  The 
operator  on  the  RHS  represents  the  diffusion  of  tracer 
fluctuations  by  molecular  motions  and  k  is  the  molec¬ 
ular  diffusivity.  The  equation  in  (1)  is  appropriate  to 
describe  the  transport  of  6  at  scales  from  a  few  millime¬ 
ters  to  thousands  of  kilometers.  However,  the  resulting 
description  is  overly  complicated.  Our  goal  is  to  derive 
a  simpler  model  that  describes  transports  in  the  ML  at 
large  scales  and  long  times  by  averaging  the  equation 
in  (1)  over  short  times  and  short  scales.  The  key  step 
in  the  analysis  is  to  find  appropriate  scales  for  the  av¬ 
eraging  so  that  we  can  derive  a  closed  equation  for  the 
averaged  concentration  e  by  folding  all  the  details  of 
the  small  scale  motions  in  a  suitable  operator  V  that 
depends  on  averaged  variables,  i.e., 

dte^n-Ve^V{ix,6),  (2) 

where  u  is  the  averaged  velocity  field. 

Some  very  popular  ML  models,  referred  to  as  bulk 
models  (e.g.  Kraus  and  Turner,  1967),  choose  to  aver¬ 
age  over  the  depth  H  of  the  ML  and  the  characteristic 
time  of  vertical  mixing  ry ,  This  choice  is  quite  natural, 
because  the  turbulent  fluxes  that  homogenize  vertically 
the  ML  are  due  to  processes  such  as  convection  and 
Langmuir  cells,  characterized  by  coherent  eddies  which 
span  the  depth  H  and  have  an  aspect  ratio  close  to  one. 
In  these  models,  the  operator  V  parameterizes  all  the 
processes  that  maintain  the  ML  well  mixed  in  the  ver¬ 
tical.  A  problem  arises  when  bulk  ML  are  implemented 
in  circulation  models  that  resolve  horizontal  scales  that 
are  orders  of  magnitude  larger  than  H,  In  this  case 
one  has  to  average  the  tracer  equation  over  H  in  the 
vertical,  but  over  a  scale  L  >  H  in  the  horizontal.  A 
typical  solution  is  to  parameterize  in  series  the  motions 
on  scales  shorter  than  H  and  those  on  scales  between 
H  and  .  That  is,  the  same  operator  D  is  retained 
to  describe  the  fluxes  that  mix  vertically  the  ML,  but 
a  lateral  effective  eddy  diffusivity  is  introduced  to  pa¬ 
rameterize  the  fluxes  at  larger  scales.  Here  we  show 
that  unbalanced  horizontal  motions  with  characteristic 
scales  between  and  L  act  in  parallel  with  the  turbu¬ 
lent  motions  on  scales  shorter  than  H.  Therefore  it  is 
necessary  to  modify  the  operator  D  and  parameterize 
all  unresolved  motions  together. 

Let  us  average  equation  (1)  over  the  depth  H  in  the 
vertical,  over  a  scale  L>  H  in  the  horizontal  and  over 
a  time  th  >  ry.  The  scales  L  and  rn  have  only  lower 
bounds,  but  are  not  specified  for  the  moment.  We  ob¬ 
tain  the  Reynolds’  averaged  equation, 

-h  u  •  VhO  =  -Vh  •  ^  -h  KV%e  +  T.  (3) 

^See  Garrett  (2001)  for  a  discussion  of  parameterizations  of 
unresolved  motions  in  parallel  and  in  series. 


+  u .  =  KV^e, 
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Figure  1.  Potential  temperature  (red  line),  salinity  (blue  line)  and  potential  density  (black  line)  from  a  horizontal  SeaSoar 
tow  at  50  m  in  Subtropical  North  Pacific,  at  140  degrees  west,  between  28  and  30  degrees  north.  This  depth  is  in  the  middle 
of  the  local  mixed  layer.  The  vertical  axis  for  temperature  and  salinity  are  scaled  by  the  respective  expansion  coefficients  so 
that  excursions  of  temperature  and  salinity  show  the  change  they  imply  on  density. 


Here  6  and  u  are  the  averaged  velocity  and  averaged 
tracer  concentration  and  9^  and  u'  are  departures  from 
those  averages.  T  represents  the  flux  of  tracer  induced 
by  the  boundary  conditions  at  the  top  and  bottom  of 
the  ML-  The  notation  Vh  is  used  to  remind  that  deriva¬ 
tives  are  taken  only  in  the  horizontal,  because  the  av¬ 
eraged  quantities  do  not  depend  on  the  vertical  coor¬ 
dinate.  In  order  to  simplify  the  discussion,  we  assume 
that  H  is  a  constant  independent  of  position  (for  more 
on  this  point  see  Young ^  1994;  Garrett  and  Tandon, 
1997). 

The  next  step  is  to  express  the  first  term  on  the 
RHS  (called  “eddy  flux  divergence”)  in  terms  of  aver¬ 
aged  quantities.  Mixing-length  theories  are  a  common 
way  to  achieve  this  goal.  The  argument  goes  that  a 
fluid  particle  carries  the  value  of  a  conserved,  and  hence 
transferable,  tracer  for  some  length  1',  before  it  is  mixed 
with  its  new  surroundings.  We  give  a  vectorial  nature 
to  1'  to  allow  for  situations  which  are  not  isotropic.  If 
the  particle  has  a  concentration  of  scalar  typical  of  its 
surroundings  then  the  eddy  flux  of  tracer  9  is  given  by 

^  =  (4) 

where  it  is  assumed  that  varies  little  over  distances 
comparable  with  the  mixing  length  1'.  The  tensor  u'l' 
defines  the  eddy  diffusivity. 

In  the  special  case  when  the  statistics  of  the  velocity 
field  are  homogeneous  and  isotropic,  the  eddy  diffusivity 
tensor  is  a  constant,  and  the  eddy  transport  assumes  the 


form  of  a  down-gradient  Fickian  diffusion: 

(5) 

This  kind  of  closure  is  commonly  applied  to  ML  models 
and  the  two  relevant  scalars  (temperature  and  salinity) 
are  diffused  with  the  same  eddy  diffusion  coefficient, 
and  are  independent  from  each  other. 

However  in  the  ML  there  are  lateral  inhomogeneities 
in  the  buoyancy^  field  at  scales  larger  than  H.  Hor¬ 
izontal  buoyancy  gradients  slump  under  the  action  of 
gravity  and  drive  horizontal  eddy  fluxes.  Therefore  we 
expect  the  transport  of  tracer  to  be  in  the  direction  of 
and  to  increase  with  VjyH.  This  breaks  the  assump¬ 
tions  of  homogeneity  and  isotropy.  Therefore  a  down- 
gradient  Fickian  diffusion  cannot  be  used  to  model  the 
ML  at  scales  larger  then  H.  A  more  appropriate  ex¬ 
pression  for  the  diffusivity  tensor  is, 

^  =  (6) 

where  7  is  a  constant  and  /(IVh^I)  a  non-dimensional 
function  whose  form  depends  on  the  details  of  the 
hydrodynamic  instabilities  that  dominate  in  the  eddy 
field.  The  expression  (6)  is  rationalized  as  follows.  Ac¬ 
cording  to  mixing-length  theories,  the  diffusivity  tensor 

^Buoyancy  B  is  defined  as  p  =  po  [l  —  »  where  p  is 

the  fluid’s  density,  po  is  a  constant  reference  density  and  g  is  the 
acceleration  of  gravity.  With  this  definition,  B  has  the  dimensions 
of  acceleration. 
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can  be  expressed  in  terms  of  the  characteristic  velocity 
^eddy  and  length  leddy  of  the  transfer  process,  that  is 
u'l'  oc  Ucddy  ^eddy  In  OUT  case  the  length  scale  is  given 
by  leddy  =  UcddyTV,  where  ry  is  the  time  for  which 
the  slumping  process  acts,  before  it  is  arrested  by  the 
turbulent  fluxes  that  mix  vertically  the  ML.  The  eddy 
velocity  is  in  the  direction  of  with  a  magnitude 
proportional  to  \VhB\,  Thus  we  get  the  expression  in 
(6),  where  the  tensor  VhBVhB  arises  from  the  direc¬ 
tion  of  the  eddy  velocity  field  and  7/(|ViyB|)  is  a  pos¬ 
itive  semidefinite  term  that  determines  the  magnitude 
of  the  flux.  Notice  that  both  the  unbalanced  motions  at 
scales  larger  than  H  and  the  turbulent  fluxes  at  scales 
shorter  than  H  enter  in  the  closure  in  6.  That  is  the 
processes  of  slumping  and  mixing  act  in  parallel. 

Plugging  (6)  into  (4)  gives  the  eddy  tracer  flux, 

^  =  -jHI^hBI)  {VhB  .  VhB,  (7) 

Notice  that,  even  though  the  flux  is  in  the  direction  of 
V.B,  u'^'  •  V0  <  0.  Thus  the  flux  of  tracer  tends  to 
be  down  the  tracer  gradient,  but  only  the  projection  of 
the  tracer  gradient  along  the  direction  of  the  buoyancy 
gradient  contributes  to  the  flux. 

We  now  apply  the  closure  in  (7)  to  the  advection- 
diffusion  equations  for  heat  and  salt  in  the  ML, 

dtT  +  uVT  = 

=  7V-[/(|VS|)(VB-VT)VB]+JFr,  (8) 

dtS  +  uVS  = 

=  7V-[/(|VB|)(VS-V5)VS]+7-s,  (9) 

where  Tt  and  Fs  represent  the  thermohaline  fluxes 
from  the  top  and  bottom  of  the  ML.  We  dropped  over¬ 
bars  and  we  replaced  with  V  for  convenience,  but 
keep  in  mind  that  all  variables  are  averaged  over  scales 
larger  than  H  and  times  longer  than  ry  and  that  deriva¬ 
tives  are  taken  only  in  the  horizontal.  We  assume  a  lin¬ 
ear  equation  of  state  and  measure  T  and  S  in  buoyancy 
units  units,  so  that, 

B^T-S.  (10) 

The  nonlinear  advection-diffusion  equations  (8)  and  (9), 
together  with  (10),  form  a  closed  system  whose  solution 
is  fully  determined  once  the  forcings  Ft  and  Fs  and  the 
large  scale  velocity  field  u  are  prescribed. 

By  adding  and  subtracting  (8)  and  (9),  we  obtain 
closed  equations  for  buoyancy  and  spice  F  =  T  -f  5 
{Veronis,  1972;  Munk,  1981),  viz., 

dtB  +  uVB  = 

=  7V  •  [/{|VB|)(VB  •  VS)VB] 

dtV  +  uvv  = 

=  7V  •  [/(|VB|)  (vy  ■  VB)VB]  +  JV,(12) 


where  Fb  =  Ft  ■”  Fs  and  Fy  =  Ft  +  Fs^  We  can 
now  see  how  the  equations  in  (11)  and  (12)  model  the 
development  of  compensation  in  the  ML.  The  prod- 
uct  /(|VB|)(VjB  •  VB)  must  be  an  increasing  func¬ 
tion  of  I VB I  to  be  consistent  with  our  assumption  that 
eddy  fluxes  are  driven  by  buoyancy  gradients.  Under 
this  constraint,  the  nonlinear  diflFusion  always  dissipates 
buoyancy,  even  more  so  when  |  VB|  is  large.  Also  spice 
is  dissipated  where  |VB|  is  large.  However  large  val¬ 
ues  of  I VUI  can  survive  in  regions  where  |VB|  is  small. 
In  terms  of  temperature  and  salinity  this  means  that 
compensated  fronts,  for  which  VT  V5  persist,  while 
buoyancy  fronts  are  short  lived. 

Young  (1994)  and  Ferrari  and  Young  (1997)  derive 
formally  equations  of  the  form  of  those  in  (8)  and  (9) 
to  parameterize  the  transport  of  heat  and  salt  on  hori¬ 
zontal  scales  of  a  few  kilometers  in  the  ML.  These  theo¬ 
retical  works  are  examples  of  the  closures  we  have  been 
discussing  when  the  averaging  is  done  over  the  depth  of 
the  ML,  over  horizontal  scales  of  a  few  kilometers  and 
time  scales  of  a  few  hours.  Nonlinear  diffusion  arises 
because  the  horizontal  transport  of  heat  and  salt  is  by 
sheair  dispersion,  and  the  shear  flow  doing  the  disper¬ 
sion  is  driven  by  slumping  horizontal  buoyancy  gradi¬ 
ents.  The  strength  of  the  shear  dispersion  increases 
as  the  horizontal  buoyancy  gradient  squared,  that  is 
/(|VB|)  =  1  in  (8)  and  (9). 

At  scales  larger  than  the  Rossby  radius  of  deforma¬ 
tion  Bo,  unbalanced  motions  are  influenced  by  rotation 
in  the  form  of  baroclinic  instability.  Therefore,  if  one 
is  to  parameterize  the  transport  of  heat  and  salt  on 
horizontal  scales  larger  than  Bo,  say  10  km  for  a  typi¬ 
cal  ML,  the  closure  must  include  the  transports  due  to 
eddies  generated  at  barocHnically  unstable  gradients. 
Green  (1970)  and  Stone  (1972)  derived  expressions  for 
the  tracer  fluxes  generated  by  baroclinic  waves.  Their 
results  predict  that  the  baroclinic  eddy  fluxes  across 
a  buoyancy  gradient  are  proportional  to  the  absolute 
value  of  the  diapycnal  buoyancy  gradient.  Green  and 
Stone  considered  only  zonally-averaged  models  and  did 
not  investigate  the  direction  of  the  fluxes.  If  their  ar¬ 
guments  are  extended  to  two  horizontal  dimensions  to 
parameterize  diapycnal  fluxes  of  heat  and  salt  in  the 
ML,  one  obtains  nonlinear  diffusion  equations  of  the 
form  in  (8)  and  (9)  with  /(|VB|)  =  |VB|“^  Notice, 
however,  that  a  full  parameterization  of  baroclinic  in¬ 
stability  should  include  the  eddy  fluxes  along  isopycnals 
as  well  {Marshall  and  Shutts,  1981).  This  issue  is  not 
pursued  further  here,  because  we  focus  on  the  role  of  di¬ 
apycnal  fluxes  on  the  establishment  of  the  temperature- 
salinity  relationship  in  the  ML. 

Chris  Garrett,  during  the  meeting,  suggested  that 
symmetric  instability  might  also  drive  thermohaline 
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fluxes  in  the  ML.  Haine  and  Marshall  (1997)  used  nu¬ 
merical  simulations  to  study  what  hydrodynamical  in¬ 
stabilities  control  the  transfer  of  buoyancy  through  the 
ML  on  scales  of  some  tens  of  kilometers.  Their  conclu¬ 
sion  is  that  nonhydrostatic  baroclinic  instability  pro¬ 
vides  the  dominant  mode  of  lateral  buoyancy  transfer. 
However  symmetric  instability  plays  an  important  role 
during  the  slumping  process  by  setting  to  zero  potential 
vorticity  along  isopycnal  surfaces.  Clearly  more  work 
need  to  be  done  to  formulate  a  closure  that  takes  into 
account  the  effects  of  both  symmetric  and  baroclinic 
instabilities. 

3.  Thermohaline  alignment  and 
compensation  in  the  mixed  layer 

The  nonlinear  advection-diffusion  equations  in  (8) 
and  (9)  are  now  used  to  investigate  how  compensation 
appears  in  the  ML.  Suppose  that  spatial  variations  in 
temperature  and  salinity  are  created  by  surface  fluxes 
that  vary  on  large  horizontal  scales.  Mesoscale  stirring 
will  create  small-scale  temperature  and  salinity  gradi¬ 
ents  by  stretching  and  folding  the  large  scale  thermo¬ 
haline  patterns.  Large  density  gradients  will  disappear 
quickly  as  a  result  of  nonlinear  diffusion,  while  com¬ 
pensated  gradients  will  persist  for  longer  times.  Thus 
we  expect  that  the  temperature  and  salinity  gradients 
present  at  small  scales  at  any  particular  moment  will  be 
typically  compensated.  We  are  now  going  to  test  this 
scenario  with  a  numerical  model. 

3.1.  Numerical  model 

The  parameterization  in  (8)  and  (9)  is  tested  with  nu¬ 
merical  simulations  in  which  temperature  and  salinity 
are  advected  using  a  velocity  field  generated  by  solving 
the  equivalent  barotropic  equations  in  the  streamfunction- 
vorticity  formulation, . 

dtC  +  J{i>,0  =  -K  +  i'^^(:  +  ^(,  (13) 

where  ^  is  the  streamfunction,  (  =  the  relative 
vorticity  and  J  the  Jacobian  operator.  The  forcing  is 
applied  in  spectral  space  at  a  scale  of  6  km  with  constant 
amplitude  and  random  phases.  The  bottom  drag  coef¬ 
ficient  is  set  to  /i  —  3  •  10*“®  s*"^  and  the  hyper- viscosity 
to  1/  =  3  •  10®  m®  s”^.  The  result  is  a  two-dimensional 
turbulent  field  with  meandering  vortices  of  a  diameter 
of  approximately  3  km  (half  the  forcing  scale)  and  RMS 
velocities  of  0.1  m  s”^  (Figure  2).  The  domain  of  inte¬ 
gration  is  a  biperiodic  square  of  51.2  x  51.2  km^  with  a 
mesh  of  100  m.  This  is  a  poor  model  of  the  mesoscale 
dynamics  of  the  ML.  In  particular  we  are  neglecting 
feedbacks  between  the  buoyancy  and  the  velocity  fields. 
But  our  goal  is  to  show  that  compensation  develops  at 


Figure  2.  Snapshot  of  the  vorticity  field  obtained  by  in¬ 
tegrating  the  equivalent  barotropic  equations.  The  typical 
size  of  vortices  is  about  3  km,  that  is  half  the  wavelength  of 
6  km  at  which  the  vorticity  equation  is  forced. 


scales  small  scales,  regardless  of  the  details  of  the  stir¬ 
ring  field  and,  in  this  context,  the  model  in  13  suffices. 

The  temperature  and  salinity  equations  in  (8)  and 
(9)  are  integrated  with  /(|VB|)  =  1,  that  is  we  use  the 
closure  in  Young  (1994)  and  Ferrari  and  Young  (1997). 
The  value  of  7  is  set  to  10^^  m^s^  appropriate  for  typical 
ML  parameters  (details  in  Ferrari  and  Young^  1997). 
However  the  qualitative  results  discussed  in  the  rest  of 
this  paper  do  not  depend  on  the  particular  choice  of 

/(|VB|). 

Temperature  and  salinity  are  forced  with  orthogonal 
sinusoidal  patterns,  that  is  we  set  !Ft  =  Fq  cos  qx  in  the 
RHS  of  (8)  and  Ts  =  Fq  sin  qy  in  the  RHS  of  (9).  The 
sinusoids  have  a  wavelength  equal  to  the  domain  size, 
i.e.,  q  =  27r/51.2  km”^.  The  amplitude  Fq  is  chosen 
such  as  to  have  thermohaline  fluctuations  of  1°C  and 
0.35  psu,  at  the  scale  of  the  domain.  These  forcings  do 
not  to  impose  any  correlation  between  temperature  and 
salinity  fluctuations.  Further  details  on  the  numerical 
code  are  given  in  Ferrari  and  Paparella  (2001). 


3.2.  Complex  density  ratio 


It  is  common  to  quantify  compensation  in  terms  of 
the  density  ratio,  defined  as  the  change  in  buoyancy  due 
to  temperature  divided  by  the  change  in  buoyancy  due 
to  salinity. 


Rid  = 


(14) 


where  temperature  and  salinity  are  defined  in  buoyancy 
units,  and  £  is  the  direction  along  which  the  cut  is  taken. 
In  two-dimensions,  it  is  convenient  to  introduce  a 
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complex  density  ratio  as, 


R  =  '^X+iTy 

~  S^+iSy' 


(15) 


The  complex  density  ratio  has  both  a  magnitude  and  a 
phase,  R  =  |i?|exp(i0):  |ii|  is  the  ratio  of  the  magni¬ 
tudes  of  the  temperature  and  salinity  gradients  and  (f) 
is  the  angle  between  them.  If  the  gradients  are  parallel 
{<j>  =  0"^)  or  antiparallel  {<!>  =  180°),  there  is  thermoha¬ 
line  alignment  and  the  definition  of  R  in  the  complex 
plane  is  equivalent  to  that  in  (14)  regardless  of  the  ori¬ 
entation  of  i.  When  |i?|  >  1,  the  change  in  buoyancy 
due  to  temperature  is  greater  than  the  change  in  buoy¬ 
ancy  due  to  salinity  along  the  direction  of  VB,  that 
is  |VB  •  VT|  >  |VB  •  V5|  and  the  buoyancy-front  is 
temperature-dominated.  The  opposite  is  true  if  \R\  <  1 
and  the  buoyancy-fi:ont  is  salinity-dominated.  The  par¬ 
ticular  case  |i?|  =  1  and  <f>  =  0^  describes  thermohaline 
compensation. 

Because  the  magnitude  of  the  complex  density  ratio 
is  infinite  when  the  salinity  gradient  vanishes,  we  char¬ 
acterize  fronts  in  terms  of  the  phase  <f>  and  the  Turner 
angle, 

Tu  =  arctan  |ii|,  (16) 

choosing  the  branch  where  0  <  Tu  <  7r/2.  All  statistics 
will  be  computed  in  terms  of  (t>  and  Tu.  For  convenience, 
results  are  discussed  in  terms  of  <f>  and  |i?|,  because  their 
values  are  more  familiar. 

In  the  following  we  use  the  joint  pdf  ^(Tu,  (f))  to  de¬ 
scribe  the  degree  of  alignment  and  compensation  in  the 
ML.  The  joint  pdf  is  normalized  according  to 

r7r/2  p2-K 

/  dTu  /  d<^P(Tu,0)  =  l.  (17) 

Jo  Jo 


3.3.  Results  of  numerical  simulations 

For  the  simulations  we  use  kilometers  to  measure  dis¬ 
tances  and  hours  to  measure  time.  Therefore  vorticity 
is  given  in  h”^  and  buoyancy  in  km  h“^.  We  set  to  zero 
the  initial  vorticity,  temperature  and  salinity.  After  an 
initial  transient  of  several  eddy  turnover  times,  kinetic 
energy,  enstrophy,  temperature  and  salinity  variances 
settle  to  a  constant  value;  i.e,  the  system  reaches  an 
equilibrium  between  the  variance  input  by  the  forcing 
at  large  scales  and  dissipation  at  small  scales. 

In  Figures  3  and  4,  we  show  snapshots  of  spice  and 
buoyancy  700  h  after  the  beginning  of  the  simulation. 
It  is  difficult  to  recognize  in  these  snapshots  the  large 
scale  patterns  of  buoyancy  and  spice  imposed  by  the 
forcing  described  in  section  3.1.  But  the  sinusoidal  pat¬ 
terns  emerge  clearly  if  one  averages  the  fields  over  times 
of  the  order  of  a  few  himdred  hours.  At  small  scales  the 


two  fields  are  remarkably  different.  A  comparison  of  the 
black  contours  in  the  two  figures  shows  that  gradients 
of  spice  are  sharper  than  those  of  buoyancy:  buoyancy 
contours  are  evenly  spaced,  while  spice  contours  are  ex¬ 
tremely  packed  in  a  few  regions  and  widely  spaced  in 
others.  Sharp  spice  gradients  with  no  signature  in  buoy¬ 
ancy  imply  VT  «  V5,  i.e.,  thermohaline  compensation. 

The  small  scale  variability  in  Figures  3  and  4  is 
produced  by  stirring  the  large  scale  thermohaline  pat¬ 
terns.  The  temperature  and  salinity  gradients,  com¬ 
puted  across  the  grid  spacing  of  100  m,  are  typically 
aligned.  Alignment  occurs  because  the  isolines  of  T 
and  5  are  stretched  by  the  same  stirring  field  and  thus 
both  tracers  end  up  with  gradients  pointing  in  the  same 
directions  (ifua,  2001).  This  is  shown  through  the  joint 
pdf  P(Tu,  (j))  (Figure  5).  The  overwhelming  majority  of 
points  in  the  pdf  have  angles  very  close  to  either  (^  =  0° 
or  </>  =  180°.  But  this  is  not  the  whole  story,  because 
not  all  values  of  |i?|  are  equally  probable  along  those 
two  angles.  The  pdf  has  a  clear  peak  at  R  =  1.  This 
is  the  signature  of  nonlinear  diffusion  which  selectively 
dissipates  all  gradients  whose  density  ratio  is  different 
from  one  and  establishes  compensation.  Stirring  alone 
does  not  produce  a  single  peak  in  the  pdf,  because  it 
acts  only  on  the  relative  orientation  of  VT  and  V5  but 
not  on  the  ratio  of  their  magnitudes.  This  was  checked 
by  running  a  simulation  in  which  the  nonlinear  diffusion 
was  set  to  zero.  In  this  limit,  the  pdf  P(Tu,  4)  is  indeed 
collapsed  along  the  angles  0  =  0°  and  0  =  180°,  but  it 
does  not  have  a  single  mode. 

Compensation  is  not  maintained  always  and  every¬ 
where  in  the  domain.  There  are  regions,  in  Figures  3 
and  4,  where  buoyancy  and  spice  gradients  are  compa¬ 
rable.  This  happens  when  the  stirring  field  momentarily 
creates  large  buoyancy  gradients  at  small  scales.  These 
gradients  do  not  persist  for  long,  though,  because  non¬ 
linear  diffusion  restores  compensation  in  a  few  hours. 
At  any  time,  a  one  dimensional  cut  through  the  do¬ 
main  shows  many  compensated  fronts  and  some  rare 
buoyancy  fronts.  This  result  agrees  with  the  thermo¬ 
haline  structure  found  by  Ferrari  and  Rudnick  (2000) 
in  the  ML  of  the  Subtropical  North  Pacific  (Figure  1), 
where  almost  all  temperature  and  salinity  fluctuations 
are  compensated. 

4.  Implications  for  numerical  models  of 
the  mixed  layer 

In  the  previous  sections  we  have  suggested  that  the 
thermohaline  compensation  observed  in  the  ML  is  con¬ 
sistent  with  preferential  diffusion  of  horizontal  buoy¬ 
ancy  gradients.  The  theoretical  argument  implicates 
vertically  sheared  currents  within  the  ML  as  the  agent 
which  produces  the  preferential  horizontal  transport  of 
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Figure  3.  Snapshots  of  spice  at  the  same  time  of  figures  2 
and  4.  The  colored  pattern  hints  at  the  large-scale  sinu¬ 
soidal  checkerboard,  imposed  on  the  spice  field  through  the 
thermohaline  forcing. 
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Figure  4.  Snapshots  of  buoyancy  at  the  same  time  of 
figures  2  and  3.  The  colored  pattern  hints  at  the  large- 
scale  sinusoidal  checkerboard,  imposed  on  the  buoyancy  field 
through  the  thermohaline  forcing.  At  small  scales  buoy¬ 
ancy  shows  less  structure  and  milder  gradients  than  spice. 
Regions  with  large  spice  fluctuations  with  no  signgiture  in 
buoyancy  are  the  trademark  of  compensation. 


Figure  5.  Joint  pdf  P(Tu,  of  the  complex  density  ratio  R 
defined  in  section  3.2.  The  azimuthal  position  in  the  radar 
plot  indicates  the  angle  between  the  temperature  and  the 
salinity  gradients,  while  the  radial  displacement  is  the  ratio 
of  their  magnitudes.  The  pdf  is  an  average  over  200  hours 
during  which  the  simulation  was  in  equilibrium.  Nearly  all 
points  lie  along  the  line  of  alignment,  that  is  at  angles  — 
and  <!>  =  180°.  The  maximiun  of  the  pdf  is  at  R  =  1  and 
shows  that  thermohaline  fronts  are  typically  compensated. 

density.  Numerical  models  with  bulk  MLs  do  not  in¬ 
clude  this  physics.  The  purpose  of  this  last  section  is 
to  suggest  a  simple  parameterization  of  ML  horizon¬ 
tal  transports  that  might  improve  the  representation  of 
thermohaline  dynamics  in  numerical  models  with  bulk 
ML. 

Bulk  ML  parameterizations  ignore  the  potential  en¬ 
ergy  stored  in  horizontal  buoyancy  gradients.  But  we 
contend  that  the  release  of  this  potential  energy  plays 
an  important  role  in  establishing  compensation.  The 
system  in  (11)  and  (12)  describes  the  horizontal  dynam¬ 
ics  of  the  vertically-averaged  fields  and  could  be  imple¬ 
mented  in  a  bulk  ML  model.  However  the  nonlinear 
diffusive  terms  on  the  RHS  of  (11)  and  (12)  are  difficult 
to  integrate  numerically.  In  small  regions  with  large 
buoyancy  gradients  the  diffusive  constraint  on  the  time 
stepping  becomes  severe  {Ferrari  and  Paparella,  2001) 
and  the  whole  calculation  proceeds  very  slowly.  The 
path  we  follow  here  is  to  derive  a  substitute  model  that 
retains  the  basic  physics  of  (11)  and  (12),  but  that  is 
easy  to  integrate  numerically.  That  is  we  write  a  linear 
model  that  diffuses  horizontal  buoyancy  gradients  more 
efficiently  than  spice  gradients  in  the  following  way, 

= 


+  u  •  VB 
Vi  +  u  ‘  VF 


(18) 

(19) 
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By  setting  kb  ^  buoyancy  gradients  decay  faster 
than  spice  gradients.  The  main  advantage  of  the  model 
in  (18)  and  (19)  over  that  in  (11)  and  (12)  is  that  the 
diffusivities  are  independent  of  the  buoyancy  gradients 
and  therefore  the  diffusive  constraints  on  the  time  step¬ 
ping  do  not  grow  unbounded. 

In  order  to  match  observations,  the  two  diffusivi¬ 
ties  Kb  and  Ky  must  be  chosen  such  that  compensa¬ 
tion  happens  mostly  at  scales  below  10  km  {Ferrari  and 
Rudnick^  2000).  That  is  the  dissipation  cutoff  scale  for 
buoyancy  must  be  of  the  order  of  10  km,  while  the  dis¬ 
sipation  cutoff  scale  for  spice  must  be  smaller.  The 
dissipation  cutoff  scale  for  buoyancy  can  be  estimated 
as. 


where  a  is  the  RMS  strain  rate  of  to  the  mesoscale  eddy 
field  u.  A  reasonable  strain  rate  in  the  ML  is  of  the  or¬ 
der  of  10"“®  s“^.  By  imposing  Ldus  ^  10  km,  it  follows 
that  Kb  «  10®  m^  s“^.  The  choice  of  Ky  is  somewhat 
arbitrary,  but  it  should  be  a  couple  of  orders  of  magni¬ 
tude  smaller  than  /cbj  so  that  there  is  at  least  a  decade 
between  the  cutoff  scales  of  spice  and  buoyancy. 

The  final  step  is  to  write  the  linear  model  in  (18)  and 
(19)  in  terms  of  temperature  and  salinity,  by  using  once 
more  the  linear  expressions  for  buoyancy  JB  =  T— 5  and 
spice  F  =  r  -H  5, 

Tt  +  u-VT  =  (21) 

5t+u-V5  =  - /c_ V^T,  (22) 

where  «;+  =  {kb  +  K.y)/2  and  =  (kb  —  Ky)/2.  The 
coupling  between  the  salt  and  heat  fluxes  in  (21)  and 
(22)  is  formally  similar  to  the  Soret  and  DuFour  effects 
that  operate  on  a  molecular  level  (Caldwell,  1973).  The 
main  difference  is  that,  in  the  present  case,  all  terms 
in  the  RHS  of  (21)  and  (22)  are  of  the  same  order  and 
none  can  be  neglected.  Only  for  kb  —  the  coupling 
between  temperature  and  salinity  disappears. 

The  parameterization  in  (21)  and  (22)  has  been 
tested  by  integrating  the  equations  with  the  velocity 
field  and  thermohaline  forcings  described  in  section  3.1. 
The  lower  panel  of  Figure  6  shows  the  pdf  of  the  merid¬ 
ional  density  ratio  at  a  scale  of  3  km  obtained  with  the 
linear  model.  The  pdf  has  a  clear  peak  at  iZ  =  1,  as 
in  the  observations  (upper  panel  of  Figure  6).  Notice 
that  the  large  scale  density  ratio  for  the  same  simula¬ 
tion  is  uniform.  Thus  compensation  is  a  result  of  the 
parameterization  in  (21)  and  (22),  and  is  not  due  to  the 
external  forcing. 


R 

-1  0  1  2  oo 


-1  0  1  2  oo 


Tu=atan(R) 

Figure  6.  Probability  density  function  of  the  horizontal 
mixed-layer  Turner  angle  across  a  distance  of  3  km  along 
25-35  degrees  in  the  North  Pacific  (upper  panel)  and  across 
the  same  distance  in  a  simulation  with  the  proposed  mixed 
layer  parameterization  (lower  panel).  The  values  of  Turner 
angle  and  density  ratio  are  indicated  on  the  upper  and  lower 
axes.  The  pdfs  have  a  peak  close  to  R  =  1  and  represent 
thermohaline  fields  characterized  by  buoyancy  compensa¬ 
tion. 

5.  Conclusions 

We  have  shown  that  the  ubiquitous  compensation 
of  thermohaline  gradients  observed  in  the  ML  is  consis¬ 
tent  with  the  theoretical  arguments  of  Young  (1994)  and 
Ferrari  and  Young  (1997).  Compensation  can  be  ex¬ 
plained  as  the  preferential  diffusion  of  horizontal  buoy¬ 
ancy  gradients  which  occurs  because  unbalanced  motion 
due  to  these  gradients  is  stronger  in  the  ML  than  in  the 
more  nearly  geostrophic  interior.  The  horizontal  pres¬ 
sure  gradients  associated  with  the  buoyancy  gradients 
produce  “exchange  flows”  which  act  to  restratify  the 
ML  in  the  vertical.  The  turbulent  fluxes,  that  contin¬ 
uously  mix  the  ML  in  the  vertical,  oppose  the  restrati¬ 
fication  and  weaken  the  horizontal  buoyancy  gradients. 
This  process  is  essentially  shear  dispersion  of  buoyancy, 
where  the  shear  flow  is  driven  by  the  density  gradients 
themselves. 

The  theoretical  arguments  of  Young  and  collabora¬ 
tors  implicate  that  eddy  fluxes  of  heat  and  salt  in  the 
ML  are  in  the  direction  of  the  buoyancy  gradients  and 
act  to  weaken  the  horizontal  buoyancy  stratification. 
That  is,  the  thermohaline  diapycnal  fluxes  remove  the 
energy  stored  in  horizontal  buoyancy  gradients.  The  sit- 
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uation  is  reversed  below  the  ML  base,  where  the  avail¬ 
able  potential  energy  is  believed  to  be  removed  by  adi¬ 
abatic  processes.  The  difference  in  the  two  cases  is  that 
there  are  strong  diabatic  motions  in  the  ML  at  small 
scales,  while  flows  are  mostly  along  isopycnals  in  the 
ocean  interior.  Expressions  like  the  one  in  (7)  might  be 
a  first  step  toward  a  better  parameterizations  of  diapy- 
cnal  fluxes  in  the  ML. 

We  implemented  the  nonlinear  diffusive  parameteri¬ 
zation  of  heat  and  salt  in  a  simple,  idealized  model  of 
the  ML.  Numerical  simulations  showed  that  buoyancy- 
driven  diffusion  is  complemented  and  locally  enhanced 
by  the  mesoscale  stirring  field.  The  strain  in  the  ve¬ 
locity  field  continuously  creates  thermohaline  gradients 
at  small  scales.  Nonlinear  diffusion  selectively  removes 
buoyancy  gradients.  As  a  result  at  any  single  time  typi¬ 
cal  temperature  and  salinity  gradients  are  compensated, 
in  agreement  with  the  observations  of  Ferrari  and  Rud- 
nick  (2000).  Our  model  does  not  include  any  feedback 
between  the  velocity  field  and  buoyancy,  as  due  to  ro¬ 
tation.  An  obvious  direction  for  future  research  is  to 
investigate  whether  this  feedback  plays  an  important 
role  in  the  establishment  of  the  temperature-salinity  re¬ 
lationship  of  the  ML. 

Our  discussion  emphasized  the  role  of  diapycnal 
fluxes  in  removing  horizontal  buoyancy  gradients.  A 
question  arises  as  to  what  maintains  the  long-lived 
buoyancy  fronts  observed  at  some  locations  in  the  ML 
(e.g.  Roden,  1975;  Rudnick  and  Luyten,  1996).  First, 
these  fronts  have  horizontal  scales  of  a  few  tens  of  kilo¬ 
meters.  The  buoyancy-driven  fluxes,  discussed  above, 
are  mostly  active  at  scales  shorter  than,  say,  10  kilome¬ 
ters  and  are  weaker  at  larger  scales.  Second,  ML  fronts 
are  believed  to  be  maintained  by  smrface  forcing  or  by 
convergences  in  the  velocity  field.  When  this  is  the  case, 
the  diapycnal  thermohaline  fluxes  do  not  remove  com¬ 
pletely  the  buoyancy  gradient.  Instead  an  equilibrium 
is  reached  between  nonlinear  diffusion  and  forcing.  The 
result  is  a  buoyancy  front  in  which  temperature  and 
salinity  partly  oppose  each  other,  but  do  not  compen¬ 
sate.  Observations  indeed  show  partial  cancellation  of 
the  thermohaline  components  at  many  ML  fronts. 

We  also  derived  a  linear  model  that  reproduces  the 
preferential  diffusion  of  horizontal  buoyancy  gradients. 
The  hnear  parameterization  produced  realistic  distribu¬ 
tions  of  the  density  ratio  in  a  simple,  idealized  model 
of  the  ML.  It  remains  an  open  question  whether  the 
new  parameterization  would  produce  realistic  diapyc¬ 
nal  fluxes  and  water  mass  conversions,  if  implemented 
in  high-resolution  ocean  models  with  bulk  ML. 
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Water  mass  transformation  due  to  mixed  layer 
entrainment  and  mesoscale  stirring;  In  series  or 
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Abstract.  The  convergence  of  advective  and  diflFusive  buoyancy  flux  must 
match  the  air-sea  buoyancy  flux  between  two  outcropping  isopycnals.  This 
leads  to  a  diagnostic  framework  for  water  mass  transformation  in  which  a 
myriad  of  difierent  processes  can  be  incorporated  imder  a  unif^ng  balance.  We 
review  how  the  diapycnal  advection  due  to  ubiquitous  mixed  layer  entrainment 
can  be  included  in  this  framework,  and  we  estimate  its  contribution  to  the 
large  scale  transformation.  We  also  consider  how  decomposing  the  flow  and 
buoyancy  fleld  into  mean,  eddy  and  turbulent  parts  leads  to  clarifying  the 
interaction  of  mixed  layer  and  mesoscale  (or  sub-mesoscale)  eddies  in  the 
overall  large  scale  balance. 


1.  Introduction 

Quantifying  the  water  mass  changes  and  finding  the 
water  mass  formation  rates  for  specific  density  classes 
are  central  to  understanding  the  diabatic  ocean  circula¬ 
tion.  Accurate  quantification  of  water  mass  formation 
rates  is  also  important  for  understanding  variability  in 
oceanic  heat  and  freshwater  fluxes  and  their  contribu¬ 
tion  to  climate. 

The  cvurent  description  of  water  mass  transforma¬ 
tion  was  formulated  by  Wolin  (1982),  who  proposed 
that  the  poleward  surface  drift  in  the  ocean  can  be  de¬ 
termined  directly  by  the  air-sea  heat  flux  —Bq  at  the 
surface.^  This  led  Walin  to  suggest  the  relation 

=  (1) 

where  F  is  the  cross-isotherm  advection  (volumetric 
flow  rate),  and  <55  is  the  area  enclosed  between  the  two 
isotherm  outcrops  as  shown  in  Figure  1. 

This  relationship  was  subsequently  successfully  used 
in  diagnostic  calculations  {Tziperman  1986,  Speer  and 
Tziperman  1992).  These  calculations  showed  close  cor¬ 
respondence  between  the  water  mass  formation  derived 
by  the  diathermal  advection  from  the  Walin  relation 
and  the  estimated  rates  of  water  mass  formation  for 


^The  interest  in  water  mass  formation  rate  in  specific  density 
classes  has  followed  various  diagnostics,  e.g.  in  Walin  (1982) 
arising  out  of  considering  budgets  with  heat  fluxes  and  isotherms, 
and  in  other  studies,  the  distribution  of  isopycnals  and  buoyancy 
fluxes.  Thus  we  retain  the  symbol  Bo  for  both  surface  heat  fluxes 
and  surface  buoyancy  fluxes. 


Figure  !•  A  schematic  of  the  diapycnal  advection  F(T) 
considered  by  Walin  (1982).  The  net  surface  heat  flux 
is  balanced  by  a  net  advective  flux  at  the  surface. 


Sub- Tropical  mode  water  and  the  Sub-Polar  Mode  Wa¬ 
ter  from  observations.  However,  the  Walin  (1982)  for¬ 
mulation  assumed  negligible  mixing  in  the  upper  ocean. 

In  the  same  year  that  Walin  proposed  his  ideas  about 
this  diathermal  advection,  Niiler  and  Stevenson  (1982) 
sought  to  constrain  the  values  of  diapycnal  mixing  by 
considering  the  heat  budget  of  the  isotherms  that  out¬ 
crop  in  the  tropics  and  are  subject  to  net  heat  gain 
through  the  year  as  shown  in  Figure  2. 

In  this  case,  since  by  definition  the  net  change  in  the 
mean  annual  isothermal  position  is  zero,  there  cannot 
be  a  net  diathermal  advection  and  the  net  heat  flux 
must  be  balanced  by  a  net  diffusive  flux  due  to  mix¬ 
ing.  Therefore,  at  least  in  the  limiting  case  of  closed 
mean  isopycnal  surfaces,  the  role  of  diffusive  flux  due 
to  mixing  cannot  be  ignored,  and  the  surface  drift  of 
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Figure  2.  A  schematic  of  the  outcropping  tropical  an¬ 
nual  mean  isotherms  considered  by  Niiler  and  Steven¬ 
son  (1982).  The  net  surface  heat  flux  must  be  balanced 
by  a  net  diffusive  flux  due  to  mixing. 


Figure  4.  Garrett  et  al.  (1995)’s  ideal  case.  With  no 
horizontal  mixing  and  only  vertical  mixing,  D{b)  can 
be  ignored  and  the  diapynal  advection  equals  the  air- 
sea  transformation,  or,  A  =  F. 


Bo 


Figure  3.  Air-sea  buoyancy  flux  Bq  SS  between  two 
outcropping  isopycnals  b  and  b-\-  Sb  must  be  balanced 
by  convergence  of  diapycnaJ  advective  flux  due  to  ad¬ 
vection  A{b)  and  diffusive  flux  D{b).  Prom  Garrett  et 
al.  (1995). 


Walin  (1982)  cannot  achieve  this  balance. 

This  realization  led  Garrett  et  al.  (1995)  to  clarify 
the  Walin  formulation.  They  insisted  that  considering 
the  diapycnal  advection  about  any  averaged  isopycnal 
position  would  require  diapycnal  mixing,  since  other¬ 
wise  the  isopycnal  would  simply  be  advected  around  by 
the  flow.  As  Figure  3  shows,  the  convergence  of  the  di¬ 
apycnal  advective  and  diffusive  fluxes  must  match  the 
air-sea  buoyancy  flux  between  two  isopycnals. 

The  Walin  (1982)  approach  was  thus  generalized  by 


Garrett  et  al.  (1995)  as 

A{b)  =  F{b)-^  (2) 

where  D{b)  is  the  diapycnal  diflFusive  flux  across  the 
isopycnal  6.  Formal  derivations  of  this  relation  appear 
in  GarreU  et  al  (1995)  and  in  Marshall  et  al  (1999). 

D{h)  consists  of  many  mixing  processes  that  can  re¬ 
sult  in  diffusive  flux  across  isopycnal  6,  There  will  be 
contributions  to  D  from  diffusive  mixing  in  the  thermo- 
cline,  while  horizontal  diffusive  fluxes  due  to  mesoscale 
stirring  would  arise  near  the  ocean  surface,  and  tidal 
mixing  near  rough  bathymetry  for  the  isopycnals  that 
intersect  with  the  ocean  bottom  in  such  regions.  In 
so  far  as  the  balance  in  the  upper  ocean  is  concerned, 
if  the  control  volume  is  chosen  to  be  boxmded  by  two 
isopycnals  h  euid  6  -f  56  and  a  control  surface  at  the  bot¬ 
tom  of  the  mixed  layer  (Figure  4),  Garrett  et  al  (1995) 
showed  that  for  negligible  horizontal  mixing  and  with 
vertical  isopycnals,  vertical  mixing  J3(6)  makes  no  con¬ 
tribution  and  the  net  diapycnal  advection  is  only  due 
to  the  air-sea  transformation,  that  is, 

yl(6)  =  F(6)  (3) 

where  F{b)  =  is  the  air-sea  transformation, 

with  the  negative  sign  due  to  the  convention  that  the 
diapycnal  advection  from  less  buoyant  to  more  buoyant 
water  mass  is  considered  positive  (reversing  the  signs  of 
Walin ’s  convention  in  Figure  1).  This  is  an  important 
idealization,  since  in  this  case  the  diapycnal  advection 
can  be  fully  diagnosed  by  surface  buoyancy  flux  and 
surface  buoyancy  outcrop  distribution. 

Speer  (1997)  evaluated  F{b)  and  A{b)  using  a  hy¬ 
drographic  section  at  11°S,  as  shown  in  the  schematic 
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Figure  5,  A  schematic  showing  the  control  volume  for 
the  diagnostic  calculation  carried  out  by  Speer  (1997). 


Figure  6.  The  ideal  algorithm  can  no  longer  be  ex¬ 
pected  to  be  valid  if  time-dependent  effects  are  taken 
into  account  with  entrainment  at  the  mixed  layer  base. 


Figure  5.  These  diagnostic  calculations  showed  that 
the  residual  between  A(6)  and  F(b)  is  significant,  and 
therefore  transformation  due  to  the  diffusive  flux  term 
dD{h)/dh  must  be  considered.  While  this  diagnostic 
calculation  cannot  be  used  to  pinpoint  which  processes 
contribute  to  the  D  term,  Speer  (1997)  showed  that  the 
calculations  are  consistent  with  diapycnal  mixing  in  the 
thermocline. 

The  formulation  considered  thus  far  has  assumed 
steady  state,  but  an  accumulation  within  the  isopyc- 
nals  can  be  interpreted  as  a  net  inflow,  and  (2)  can  still 
be  used.  A  steady  state  scenario  neglects  the  role  of  di¬ 
apycnal  advection  due  to  time  dependent  deepening  and 
shallowing  of  the  mixed  layer,  at  both  seasonal  and  di¬ 
urnal  timescales.  Conceptual  modifications  are  required 
to  relax  the  steady  state  assumption  for  the  water  mass 
transformation  balance  as  shown  in  Figure  6. 

The  next  section  considers  diapycnal  advection  due 
to  the  ubiquitous  process  of  mixed  layer  deepening  and 
entrainment.  Upwelling  at  the  base  of  the  mixed  layer 


(a)  (b) 


Figure  7.  A  schematic  showing  isopycnal  distribution 
(left)  and  buoyancy  distribution  (right).  The  solid  lines 
are  before  the  mixing  event  and  the  dashed  lines  are 
after  the  mixing  event. 


and  subsequent  mixing  results  in  water  mass  transfor¬ 
mation  which  is  conceptually  very  similar  to  that  due  to 
mixed  layer  deepening  and  entrainment  at  the  base  of 
the  mixed  layer.  Regions  of  significant  upwelling  such  as 
those  foimd  near  submesoscale  instabilities  near  fronts 
are  considered  next  via  an  example  from  FASINEX 
{Weller  1991).  The  final  section  considers  whether  the 
stirring  due  to  mesoscale  processes  should  be  considered 
as  an  independent  process  for  water  mass  formation  and 
hence  considered  to  be  independent  of  the  transforma¬ 
tion  due  to  the  mixed  layer  entrainment,  or,  do  the  two 
processes  happen  together,  in  series  and  are  interdepen¬ 
dent.  The  water  mass  transformation  rates  would  only 
be  additive  in  the  former  case.  We  conclude  with  a 
summary  and  unanswered  questions. 


2.  Transformation  Due  to  Mixed  Layer 
Deepening 


Garrett  and  Tandon  (1997)  show  that  buoyancy  re¬ 
distribution  in  the  vertical  by  mixing  achieves  diapycnal 
advection  across  an  isopycnal  surface.  Figure  7  shows 
an  isopycnal  that  outcrops  at  position  xq  due  to  vertical 
mixing,  and  the  buoyancy  balance  is  shown  on  the  right. 
X  is  taken  to  be  the  direction  normal  to  the  outcrop 
towards  less  dense  waters. 

With  a  uniform  buoyancy  gradient  in  the  vertical  and 
in  the  horizontal  everywhere,  the  volume  between  the 
successive  isopycnais  does  not  change,  i.e,  there  is  no 
diapycnal  advection.  However,  a  departure  from  these 
conditions  implies  a  net  diapycnal  advection.  In  par¬ 
ticular,  if  a  pre-existing  mixed  layer  of  depth  —ft  -f-  dh 
deepens  to  —ft,  as  shown  in  Figure  8,  Garrett  and  Tan¬ 
don  (1997)  show  that  the  net  diapycnal  advection  Fml 
is  given  by 


Fml 


(«;  +  1^)  dt  (4) 
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(a)  Isopycnal  position 


(b)  Buoyancy  profiles  at  x  =  xq 


bob^ 


-h  +  6h - 1 - 

-h . 


bo-b,  =-Ab^ 


Figure  8.  A  schematic  showing  isopycnal  distribu¬ 
tion  (left)  and  buoyancy  distribution  (right)  for  a  pre¬ 
existing  mixed  layer  in  presence  of  both  bxx  and 
The  solid  lines  are  before  the  mixing  event  and  the 
dashed  lines  are  after  the  mixing  event. 


per  unit  outcrop  length,  the  subscripts  i  and  s  refer 
to  the  gradients  below  the  mixed  layer  (interior)  and 
surface  mixed  layer  respectively,  Ab  is  the  buoyancy 
change  at  the  mixed  layer  base  before  a  deepening  event, 
h  is  the  mixed  layer  depth,  w  is  the  vertical  velocity  at 
the  mixed  layer  base,  bxx^bx  are  buoyancy  derivatives 
calculated  at  the  surface,  t  is  the  time  and  T  =  1  yr. 
Fml  is  the  integrated  diapycnal  advection  over  the  year 
for  each  deepening  event  w+^  >0,  since  the  upwelling 
with  vertical  velocity  w  has  an  effect  similar  to  mixing 
and  subsequent  deepening  by  u;  St. 

The  Fml  expression  above  consists  of  two  terms,  the 
first  depending  on  vertical  variation  of  horizontal  buoy¬ 
ancy  gradients  and  the  second  depending  on  the  cur¬ 
vature  or  the  horizontal  variation  of  the  surface  buoy¬ 
ancy  gradients.  Garrett  and  Tandon  (1997)  remark  on 
several  cases  in  the  limit  as  bxi,  the  buoyancy  gradi¬ 
ent  at  the  mixed  layer  interface  and  bxs,  the  buoy- 
ancy  gradient  at  the  surface  vanish,  showing  that  the 
singularity  in  (4)  is  no  worse  than  the  singularity  in 
-^air— sea  =  Bo{dS/db).The  formulae  need  information 
on  both  large-scale  horizontal  gradients  and  vertical  cy¬ 
cling  of  the  mixed  layer. 

Tandon  and  Zahariev  (2001)  have  used  the  Marine 
Light  Mixed  Layer  (MLML)  experiment  mooring  obser¬ 
vations  {Plueddemann  et  al.  1995)  which  include  both 
spring  and  fall  mixed  layer  transitions  and  used  a  com¬ 
bination  of  mixed  layer  model  results  and  hydrography 
{daSilva  et  al  1994)  to  calculate  the  bxx  terms  to  get 
an  order  of  magnitude  estimate  of  Fml^  The  sensitivity 
to  averaging  of  synoptic  events  is  also  explored.  Their 
calculations  indicate  that  if  hourly  winds  are  used,  the 
water  mass  transformation  due  to  mixed  layer  entrain¬ 
ment  has  annual  peak  contributions  of  about  0(4)  Sv 
for  ct  —  24.0.  This  is  comparable  to  the  annual  trans¬ 
formation  attained  by  diapycnal  mixing  in  the  upper 
ocean  water  masses  by  Zhang  and  Talley  (1998).  How¬ 
ever,  with  daily  averaged  winds  and  without  diurnal 


Figure  9.  With  meanders  superposed  on  a  mean  isopy¬ 
cnal  distribution,  Fml  estimates  are  higher,  both  due 
to  the  variation  in  buoyancy  gradient  and  the  increased 
length  of  the  isopycnal  over  which  water  masses  get 
transformed. 

variation  in  buoyancy  forcing,  this  contribution  is  up  to 
an  order  of  magnitude  smaller.  Another  set  of  mixed 
layer  simulations  includes  an  annual  cycle  with  a  shal¬ 
low  and  strong  summer  thermocline.  Inclusion  of  syn¬ 
optic  summer  forcing  for  this  scenario  leads  to  trans¬ 
formation  values  several  times  larger  than  above,  about 
0(14)  Sv  at  <Tt  =  24.0.  The  peak  contribution  in  this 
case  is  almost  two  orders  of  magnitude  smaller  if  the 
synoptic  forcing  is  averaged  daily  and  the  diurnal  cycle 
is  not  resolved.  The  upper  bound  for  the  bxz  term  at 
the  MLML  experiment  site  is  0(3)Sv.  These  estimates 
are  likely  to  be  imderestimates  since  hydrographic  data 
average  over  isopycnal  meanders,  due  to  both  increase 
in  the  curvature  terms  and  an  increase  in  the  isopycnal 
length  (Figure  9).  More  details  are  presented  in  the  ap¬ 
pendix  of  Tandon  and  Zahariev  (2001).  Contribution 
due  to  mesoscale  and  sub-mesoscale  meanders  is  also 
conceptually  considered  later  in  more  detail  in  section 
4.2  of  this  paper. 

3.  Transformation  Due  to  Upwelling 
Near  Fronts 

While  the  bxz  term  was  found  to  be  small  in  Tandon 
and  Zahariev  (2001)  away  from  the  fronts,  this  need  not 
be  the  case  near  fronts.  Mesoscale  eddies  (wavelength 
~  500  km)  strain  the  surface  into  0(100  km)  tongues, 
which  leads  to  sub-mesoscale  features  of  20-50  km  di¬ 
ameter  which  induce  vertical  velocities  of  0(40  m/day) 
thereafter  (e.g.  Pollard  and  Regier  (1992)  for  Frontal 
Air-Sea  Interaction  Experiment  site,  Rudnick  (1996) 
and  Rudnick  and  Luyten  (1996)  for  the  Azores  front). 


WATER  MASS  TRANSFORMATION  DUE  TO  MIXED  LAYER  PROCESSES 


109 


Due  to  the  rapid  upwelling  near  the  fronts,  the  up- 
welling  at  the  base  of  the  mixed  layer  w  dt  dom¬ 

inates  the  deepening  dh  in  the  Fml  term.  Using 
typical  FASINEX  values  of  bxs  =  1‘5  x  10“^  s'"^  (for 
temperature  gradient  of  lK/20  km),  bxx  =  1*5  x 
m-is-2  and  Ab  =  10“^  m  s”^.  With  upwelling  of  0(40 
m/day)  along  half  the  length  of  a  convoluted  frontal 
length  of  3000  km,  the  annualized  transformation  val¬ 
ues  are  0(0.2  Sv)  for  the  bxx  term  and  0(5  Sv)  for  the 
bxz  term.  The  Fml  term  is  therefore  likely  to  have  a 
greater  contribution  from  bxz  term  and  upwelling  near 
sub-mesoscale  features. 

To  summarize,  while  Fml  estimates  are  0(5)Sv,  they 
are  particularly  sensitive  to  the  resolution  of  synoptic 
events  and  sub-surface  density  structure  near  fronts. 
It  is  also  reasonable  to  ask  whether  these  effects  are 
already  assumed  to  be  present  in  a  horizontal  diffu¬ 
sive  flux  due  to  mesoscale  stirring.  Inspired  by  Gar¬ 
rett’s  review  (this  volume)  as  well  as  Nakamura  (1996) 
and  Winters  and  D’Asaro  (1996),  we  next  ask,  under 
what  circumstance  can  these  two  processes  result  in  wa¬ 
ter  mass  transformation  independent  of  each  other  and 
when  must  they  be  considered  interdependent. 

4.  Mesoscale  Stirring  and 
Mesoscale-Mixed  Layer  Interaction 

Horizontal  stirring  due  to  mesoscale  eddies  and  sub¬ 
sequent  mixing  would  contribute  to  D{b),  and  sub- 
mesoscale  upwelling  contributes  to  Fml  {b) .  How  should 
the  distinction  be  made  between  water  mass  transfor¬ 
mation  due  to  these  two  processes?  How  do  we  ensure 
that  the  transformation  isn’t  being  diagnosed  multiple 
times  under  different  processes  which  are  in  series  (and 
hence  not  independent?  These  questions  are  clearly  re¬ 
lated  to  the  how  the  resolved  and  imresolved  processes 
are  defined. 

Recent  understanding  {Speer  et  al  1997;  Garrett 
and  Tandon  1997;  Marshall  et  al.  1999,  Nurser  et 
al.  1999)  has  concentrated  on  expanding  (2)  heuristi- 
cally  in  terms  of  the  physical  processes,  which  can  be 
written  as 

A{b)  =  Aeic  man  (6)  +  yle<idy(6)  + 

Amean  (b)  (5) 

—.TP  \  TP  dD^^fiy  dDixA 

-  F^,-se.  +  FML--J^ - ^  (6) 

with  (5)  for  the  dynamics  which  must  balance  (6)  for 
the  thermodynamics.  The  djmamical  processes  are 
the  diapycnal  voliime  flux  due  to  wind  forcing  (Ek- 
man),  mesoscale  diapycnal  volume  flux,  and  the  mean 
geostrophic  flow  across  isopycnals.  The  thermodynamic 
processes  are  the  air-sea  transformation  Fair~sea,  the 
contribution  due  to  mixed  layer  entrainment  Fml,  hor¬ 
izontal  mixing  due  to  eddies  dDeddy/db^  and  the  interior 


Figure  10,  An  illustration  showing  the  modified  con¬ 
trol  volume.  AEkman(i>)  refers  to  the  wind-driven  diapy¬ 
cnal  volume  flux  past  the  control  volume  surface  6,  while 
Abo1us(6)  or  Aeddy  (^)  refers  to  the  diapycnal  volume  flux 
due  to  the  mesoscale  eddies  past  this  control  surface. 
The  lower  control  surface  C  is  the  base  of  the  winter 
mixed  layer.  Dashed  vertical  lines  show  the  lateral  con¬ 
trol  surfaces  for  the  dynamic  method;  the  instantaneous 
isopycnal  positions  shown  as  solid  lines  form  the  lateral 
control  surfaces  for  the  thermodynamic  method. 

diapycnal  mixing  dDint/db.  This  procedure  is  naturally 
isopycnic/diapycnic  and  allows  comparison  of  these  di¬ 
verse  physical  processes  tmder  a  single  framework. 

The  terms  that  are  most  readily  calculated  from  cli¬ 
matological  data  in  the  above  framework  are  AEkman 
and  Fair-sea,  ^.g,  the  recent  evaluations  of  AEkman  and 
■^^r— sea  for  the  Southern  Ocean  by  Speer  et  al.  (1997). 
The  evaluation  of  AEkman  is  generally  done  follow¬ 
ing  isopycnal  surfaces  around  the  annual  cycle.  How¬ 
ever,  the  control  volmnes  for  the  dynamic  and  thermo¬ 
dynamic  methods  can  be  chosen  differently  as  long  as 
they  coincide  at  the  winter  mixed  layer  depth.  Thus, 
a  modified  control  volume  for  the  dynamic  calculation 
can  be  chosen  {Garrett  and  Tandon  1997)  such  that  this 
consists  of  the  volume  between  the  vertical  projections 
of  the  isopycnal  surfaces  below  the  winter  mixed  layer 
to  the  surface  (Figme  10).  Therefore,  this  modified  con¬ 
trolled  volume  does  not  change  much  during  the  annual 
cycle. 

4.1.  The  triple  decomposition  into  mean,  eddy, 
and  turbulent  flow 

While  choosing  a  fixed  control  volume  as  described 
above  simplifies  the  diagnostic  evaluations  of  the  dy¬ 
namic  terms  in  (5),  the  issues  concerning  the  decompo- 
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sition  and  overlap  of  the  thermodynamic  terms  in  (6) 
still  remain.  To  clarify  the  role  of  horizontal  stirring 
dDn/db  and  mixed  layer  entrainment  Fml^  following 
Davis  (1994)  and  Garrett  (this  volume),  we  apply  the 
triple  decomposition  to  the  water  mass  transforma¬ 
tion  ideas.  As  these  authors  stress,  such  a  decompo¬ 
sition  necessarily  presupposes  a  gap  in  time  (or  space) 
scales  such  that  a  three-way  decomposition  can  be  de¬ 
fined.  For  the  water  mass  transformation  diagnostics 
the  buoyancy  field  is  split  into  the  mean,  eddy,  and  tur¬ 
bulent  parts.  The  diapycnal  advection  Un  is  also  split 
into  three  parts,  each  representing  the  mean,  eddy,  and 
turbulent  advection  (subscripts  m,e,  and  t  respectively) 
across  the  mean  buoyancy  surface,  i.e, 

^  =  bm  bc-\-  hi  and  (7) 

Wn(i>m)  =  +  Une{bm)  +  (8) 

Here  we  envisage  the  mean  buoyancy  field  bm  to  be 
the  modified  mean  buoyancy  field  {Killworth  2001,  Me- 
Doug  all  and  McIntosh  2001)  although  for  the  large  scale 
water  mass  transformation  diagnostics  the  difference 
with  Eulerian  mean  buoyancy  field  is  probably  small. 
We  denote  (  )  for  averaging  over  a  time  scale  large  com¬ 
pared  to  turbulence  but  small  compared  to  that  of  the 
eddies  and  <  (  )  >  for  averaging  over  a  time  scale  large 
compared  to  that  of  the  eddies  but  short  compared  to 
that  of  the  mean.  Substituting  into  the  instantaneous 
buoyancy  equation  and  subsequent  averaging  over  the 
turbulent  time-scailes  and  the  eddy  time  scale  yields 

^bjYi  dbc.  dbt 

if  the  mean  flow  is  considered  steady  at  long  time-scales. 
The  right  hand  side  of  eq.(9)  is  the  term  and  the 
terms  on  the  left  hand  side  are  mean  and  eddy  advec¬ 
tion  of  mean  and  eddying  isopycnals  respectively.  This 
equation  is  subject  to  the  surface  boundary  condition 

<  Webe  >  +  <  Wfbt  >=  Bo  (10) 

thus  suggesting  that  the  surface  buoyancy  flux  is  bal¬ 
anced  by  both  the  vertical  buoyancy  flux  due  to  eddies 
and  turbulent  flux  at  the  surface.  Garrett  (this  volume) 
discusses  the  buoyancy  variance  budget  for  this  decom¬ 
position. 

4.2.  Mesoscale  transformation  and  the 
entrainment  contribution;  In  series  or  parallel? 

If  the  diagnostics  are  done  in  a  framework  that  does 
not  resolve  eddies,  then  the  transformation  relationship 
can  be  written  in  terms  of  buoyancy  balance  about  bm  : 

A{bm)  —  “^mean  (^m)  “i"  ‘^eddy(^m)  -^Ekman(^m)(ll) 


=  Fibm)-^{bm)~FuL{bm)-^ibm) 

(12) 

where  the  contribution  ^(6m)  is  due  to  the  conver¬ 
gence  of  the  eddy  stirring  horizontal  diffusive  flux  and 
FiAhibrn)  is  the  contribution  to  water  mass  transfor¬ 
mation  due  to  mixed  layer  deepening  and  entrainment, 
both  evaluated  about  the  mean  isopycnal  positions. 

Following  the  triple  decomposition  suggested  above, 
if  the  eddies  are  resolved  in  this  framework,  the  water 
mass  transformation  balance  becomes 

A{bm)  =  Amean(tm)  +  Aeddy(&m)  +  AEkman(^>m)  (13) 

=  F{bm  -f  6e)  -  -pML(tm  +  be)  -  ^^{bm  +  ^c) 

do 

(14) 

and  the  eddy-stirring  component  does  not  arise. 
The  contributions  in  eq,  (11)  and  eq.  (13)  are  the  same 
due  to  the  choice  of  control  volume  in  Figure  10.  How¬ 
ever,  the  contributions  in  eq.  (12)  and  eq.  (14)  are 
different.  In  this  case,  the  diapycnal  advection  due  to 
horizontal  mixing  by  the  eddies  and  the  contribution 
due  to  mixed  layer  entrainment  are  in  parallel  if  the 
budgets  are  considered  about  the  mean  buoyancy  sur¬ 
faces  (^Tn)  that  do  not  resolve  the  eddies  as  in  eq.  (12). 
However,  these  two  processes  are  in  series  if  budgets 
are  considered  for  eddying  buoyancy  surfaces  {bm  +  &e) 
as  in  eq,  (14),  where  eddying  isopycnals  are  defined  by 
modified  mean  density  that  averages  over  a  time  scale 
long  compau'ed  to  the  turbulence  but  short  compared  to 
the  eddies.  Thus  the  F^l  estimates  computed  by  Tan- 
don  and  Zahariev  (2001)  (section  2)  which  are  based  on 
mean  monthly  isopycnals  that  average  over  the  eddies 
correspond  to  the  estimates  for  FML(im),  while  the  es¬ 
timates  earlier  near  frontal  zones  (section  3)  correspond 
to  the  latter  case  FMiibm  +  fre)- 

5.  Discussion 

The  uncertainty  in  Fj^j^  remains  unacceptably  high 
in  both  model  based  estimates  of  coarse  non-eddy  re¬ 
solving  simulations  (e.g.  Nurser  et  al  1999)  and  data 
based  evaluations  which  are  based  on  localized  mixed 
layer  simulations  {Tandon  and  Zahariev  2001).  More 
accurate  basin  wide  estimates  are  necessary  to  make 
progress.  While  considering  budgets  in  an  eddy  resolv¬ 
ing  framework,  the  water  mass  transformation  due  to 
eddies  happens  in  series  with  diapycnal  mixing  and  en¬ 
trainment.  In  this  case,  another  significant  concern  is 
identifying  the  rate  controlling  process  for  diapycnal 
transformation  in  the  upper  ocean — do  the  mesoscale 
eddies  define  the  rate  at  which  net  diapycnal  advection 
will  happen  or  does  it  primarily  depend  on  the  rate  at 
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which  diapycnal  mixing  occurs?  The  mesoscale  stirring 
would  be  dominant  for  water  mass  transformation  if 
the  mixing  processes  are  so  efficient  in  the  upper  ocean 
that  they  act  to  dissipate  the  eddies  quickly  and  trans¬ 
form  water  masses  on  a  fast  time  scale.  In  this  case, 
it  is  the  baroclinic  instability  and  larger  scales  that 
would  set  the  net  diapycnal  advection  rates.  On  the 
other  hand  it  is  possible  that  the  mixed  layer  processes 
allow  non-linear  interactions  amongst  eddies,  and  the 
net  diapycnal  advection  is  then  significantly  dependent 
on  mixing  processes  in  the  upper  ocean.  While  recent 
eddy  resolving  simulations  have  concentrated  on  adi¬ 
abatic  properties  of  mesoscale  paramaterizations  (e.g., 
articles  by  Marshall,  Killworth,  and  McDougall  from 
this  volume)  diabatic  interactions  in  the  upper  ocean  re¬ 
main  imexplored,  and  the  question  above  remains  unan¬ 
swered.  Determining  which  process  is  dominant  for  wa¬ 
ter  mass  transformation  rate  should  be  feasible  with 
multiple  eddy-resolving  simulations  whose  sensitivity  to 
mixing  in  the  upper  ocean  and  air-sea  interaction  is  ex¬ 
plored  systematically. 
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Abstract.  Differences  in  the  response  times  of  different  biogeochemical 
tracers  to  upper  ocean  forcing  processes  can  lead  to  substantial  differences 
in  their  patterns  of  variability.  An  idealized  tracer  with  a  steep  vertical 
concentration  gradient  is  modeled  in  a  flow  field  representative  of  an  upper 
ocean  front.  The  surface  distribution  of  the  tracer  is  studied  as  a  function  of 
its  response  or  restoration  time  scale. 


Introduction 

Many  quantities,  such  us  temperature,  salinity,  dis¬ 
solved  inorganic  carbon  (DIG),  nitrate,  phosphate,  ra¬ 
diocarbon  and  dissolved  oxygen,  have  strong  vertical 
concentration  gradients  in  the  upper  500  or  so  meters  of 
the  ocean.  These  substances  are  often  forced  in  the  up¬ 
per  ocean,  and  vertical  mixing  and  transport  processes 
that  convey  the  resulting  concentration  changes  to  the 
ocean  interior  are  extremely  small.  Horizontal  advec- 
tion,  on  the  other  hand,  is  relatively  ejfficient  at  mixing 
and  homogenizing  substances,  and  hence  concentration 
gradients  are  relatively  weak  in  the  horizontal.  Anoma¬ 
lous  concentrations  of  many  biogeochemical  quantities 
in  the  upper  ocean,  are  therefore,  largely  a  result  of 
upwelling  that  brings  to  the  surface,  water  with  a  sub¬ 
stantially  different  concentration  than  its  surroundings. 
The  substances  may  also  be  altered  by  processes  like 
biological  production,  remineralization,  and  air-sea  ex¬ 
change.  Such  processes  could  induce  horizontal  variabil¬ 
ity,  but  the  variability  induced  by  upwelling  is  typically 
smaller  scaled,  more  intermittent,  and  responsible  for 
most  of  the  variance  in  the  sea-surface  distributions  of 
the  biogeochemical  quantities. 

In  an  earlier  modeling  study  (Mahadevan  and  Archer^ 
2000),  it  was  observed  that  the  nutrient  upwelling  rate 
in  the  pelagic  ocean  is  very  sensitive  to  model  resolu¬ 
tion.  Increasing  the  model  resolution  from  0.4  degrees 
to  0.1  degrees  in  a  domain  situated  in  the  interior  of  sub¬ 
tropical  gyre,  resulted  in  a  two-  to  three-fold  increase 
in  not  only  the  mean  rate  of  nutrient  supply,  but  also 
in  the  variance  of  the  field.  The  mean  and  variance 
of  other  model  variables  like  temperature  and  pC02  at 
the  sea  surface  are  not  as  sensitive  to  model  resolution 
as  new  production.  The  density  field  in  the  coarse  and 
fine  resolution  model  runs  looks  very  similar  in  its  mean 
and  variance,  but  the  fine  resolution  simulation  shows 
more  folds  in  the  isopycnal  surfaces  and  greater  convo¬ 
lutedness  in  the  frontal  outcrop  formed  by  the  intersec¬ 


tion  of  such  density  surfaces  with  the  euphotic  layer. 
The  transport  of  nutrients  from  the  thermocline  into 
the  mixed  layer  is  primarily  an  isopycnal  process.  Re¬ 
solving  the  length  of  the  front  at  which  the  flow  fleld 
is  also  highly  strained,  is  thus  extremely  important  for 
capturing  the  vertical  transport  which  occurs  on  scales 
of  the  order  of  10  km  or  less. 

Even  though  all  the  biogeochemical  tracers  in  such  a 
model  are  affected  by  vertical  motion,  the  surface  dis¬ 
tribution  of  some,  for  example  oxygen  and  H2O2,  ap¬ 
pears  more  fine  scaled,  i.e.,  there  is  more  variance  at 
a  given  length  scale  than  in  others  like  DIG  and  tem¬ 
perature.  While  there  are  many  reasons  for  differences 
in  the  distributions  of  different  tracers,  I  believe  that 
this  is  largely  a  result  of  the  difference  in  the  response 
times  of  the  tracers  to  various  forcing  processes.  Sub¬ 
stances  that  respond  quickly  to  upper  ocean  forcing  ap¬ 
pear  in  small  and  intermittent  anomalous  patches  rep¬ 
resentative  of  the  upwelling  scale.  When  substances 
are  slow  to  adjust,  the  anomalous  patches  introduced 
by  upwelling  are  stirred  and  spread  by  horizontal  ad- 
vection.  Hence  the  distribution  of  such  substances  is 
not  as  fine-scaled.  Garbon  dioxide,  for  example,  is  very 
slow  to  equilibrate  with  the  atmosphere  on  account  of 
the  buffering  effect.  While  the  oxygen  in  a  100m  deep 
mixed  layer  may  typically  take  one  month  to  equilibrate 
with  the  atmosphere,  the  DIG  would  take  about  a  year. 
The  surface  distribution  of  oxygen  is,  as  a  result,  much 
more  patchy  or  intermittent  than  DIG. 

Modeling  Study 

The  change  in  distribution  of  a  tracer  c  in  the  ocean 
can  be  modeled  using  the  equation 

dc  ^  ^ 

Q^+n-Vc- S,  (1) 

where  u  is  the  velocity  of  the  fluid  and  5  is  a  source 
or  sink  term.  S  may  parameterize  processes  hke  air- 
sea  exchange  and  biological  production  that  alter  the 
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Figure  1.  Snapshots  of  the  surface  density  field  in  the 
model  shown  on  days  46,  69  and  93  of  the  simulation  help 
visualize  the  evolving  flow  field.  The  flow  is  from  west  to 
east  along  the  the  density  contours. 


quantity  c.  Often  S  can  be  expressed  in  terms  of  a 
characteristic  response  time  or  adjustment  time.  In  the 
case  of  salinity  and  temperature,  S  could  represent  the 
effect  of  the  heat  or  evaporation-precipitation  flux,  and 
may  be  expressed  as  A“^(c  — Cq),  where  c  is  the  temper¬ 
ature  or  salinity,  cq  is  the  value  to  which  it  is  restored 
at  the  surface,  and  A  is  the  restoration  time  scale.  In 
the  case  of  nutrients,  S  may  be  written  as  A *“^00,  where 
Co  may  be  a  combination  of  the  nutrient  and  light,  and 
A  a  biological  uptake  time.  In  the  case  of  gases  that 
are  subject  to  air-sea  exchange  at  the  ocean  surface,  S 
would  represent  the  gas  flux,  for  which  the  characteris¬ 
tic  time  scale  A  is  the  ratio  of  the  mixed  layer  depth  to 
piston  velocity.  Thus  the  quantification  of  the  source  or 
sink  term  S  in  terms  of  an  adjustment  time  A  is  general 
to  many  tracers  and  processes. 

In  what  follows,  we  use  a  three-dimensional  numer¬ 
ical  model  of  an  idealized  ocean  front  to  analyze  the 
surface  distribution  of  a  tracer  as  a  function  of  its  re¬ 
sponse  time  A.  The  tracer  c  is  qualitatively  similar  to 
other  tracers  in  the  ocean;  it  has  a  steep  concentra¬ 
tion  gradient  in  the  upper  500  m.  It  is  modeled  using 
equation  (1)  which  is  coupled  to  the  dynamical  model 
described  in  Mahadevan  et  al.  (1996).  The  source  term 
on  the  right  right  hand  side  of  (1)  is  modeled  as 

0,  \iz<  Zm 

A”^[c-(l-exp(-^^))],  iiz>Zm-  ^ 

Here  z  is  the  depth  below  the  surface,  Zm  can  be  thought 
of  as  a  mixed  layer  or  euphotic  layer  depth,  and  zq  is  a 
reference  depth.  In  the  absence  of  advection,  the  tracer 
has  the  steady  state  profile  c  =  1  — exp(— (z-Zm)/^o)  at 
depths  greater  than  z^,  and  a  value  of  0  at  depths  less 
than  Zm  •  The  tracer  can  thus  be  thought  of  as  depleted 
in  the  euphotic  layer  (z,  z^),  and  beneath  it,  increasing 
exponentially  with  depth  so  as  to  approach  1  over  an 
e- folding  distance  zq  Zm  •  The  euphotic  depth  Zm  and 
e-folding  distance  zq  ^  Zm  are  each  taken  to  be  100  m 
in  these  experiments.  Advection  disturbs  this  steady- 
state  distribution  to  which  it  is  constantly  restored  with 
a  characteristic  adjustment  time  A.  When  A  is  small, 
values  of  the  tracer  anomalous  from  the  steady-state  ex¬ 
ponential  distribution  are  rapidly  annihilated.  When  A 
is  large,  the  anomalous  values  that  are  introduced  pri¬ 
marily  by  vertical  motion,  persist  for  longer,  are  rein¬ 
forced  by  further  upwelling,  and  are  spread  horizontally 
by  horizontal  advection.  Diffusion  is  negligible  at  the 
scales  considered  and  is  not  explicitly  included  in  the 
model. 

The  model  is  set  up  in  a  periodic  channel  with  solid 
north-south  boundaries.  The  domain  is  approximately 
256x284  km  in  dimension  and  centered  at  25  °N.  It  is 
initialized  with  lighter  fluid  in  the  upper  100  m  of  the 
southern  half  and  denser  fluid  in  the  upper  northern 


BIOGEOCHEMICAL  VARIABILITY 


115 


Figure  2.  The  tracer  concentration  in  the  upper  100m 
viewed  on  day  69  of  the  simulation  (middle  panel  of  Fig.  1) 
for  three  different  values  of  the  adjustment  time  A:  2.5,  20 
and  80  days.  The  tracer  distribution  is  more  patchy  and 
the  tracer  concentration  less  for  smaller  values  of  A.  The 
concentration  is  displayed  as  a  percentage  of  the  maximum 
concentration,  i.e.  it  is  multiplied  by  100. 


Figure  3.  The  variance  V (L)  of  the  surface  distribution  of 
tracer  shown  in  Fig.  2  plotted  against  wavelength  (inverse 
of  the  length  scale  L)  on  log-log  axes  for  diflFerent  values  of 
A.  V (L)  is  the  average  variance  contained  in  regions  of  area 
L  X  L.  Smaller  values  of  A  result  in  more  variance  within 
a  given  length  scale  and  a  shallower  curve  in  the  log-log 
space.  The  relationship  between  V{L)  and  A  seen  in  this 
distribution  is  qualitatively  the  same  at  other  times  in  the 
simulation. 


half.  Beneath  this  surface  layer,  the  density  in  both 
halves  converges  to  a  mean  stratification  that  is  repre¬ 
sentative  of  the  thermocline.  The  model  resolution  is 
0.04°  (approximately  4  km)  in  the  horizontal  and  varies 
in  the  vertical  from  10  m  near  the  surface  to  125  m  at 
depth.  The  tracer  is  initialized  to  its  steady  state  distri¬ 
bution  and  allowed  to  evolve  with  the  flow  field.  We  use 
five  different  values  of  A:  2.5,  10,  20,  40  and  80  days, 
and  observe  differences  in  the  surface  distributions  of 
the  tracer  as  a  function  of  A. 

Results 

The  initial  density  front  gives  rise  to  a  geostrophic 
jet  in  the  along-front  direction.  The  jet  and  the  front 
meander  as  the  flow  evolves  giving  rise  to  a  more  com¬ 
plicated  eddy  field  that  gradually  decays  (Fig.  1).  The 
surface  distribution  of  the  tracer  that  evolves  with  this 
flow  field  varies  with  A.  It  is  more  patchy  and  has  more 
variance  at  a  given  length  scale  for  smaller  values  of  A 
(see  Fig.  2).  The  tracer  distributions  are  analyzed  by 
calculating  the  variance  as  a  function  of  length  scale 
[J.W.  Campbell^  personal  communication].  The  vari¬ 
ance  is  computed  over  the  whole  domain,  and  then  over 
subregions  of  size  L,  by  partitioning  and  re-partitioning 
the  original  domain.  V {L)  is  the  average  variance  over 
all  the  subdomains  of  size  L  that  occupy  the  whole  do- 
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Figure  4.  Vertical  profiles  of  the  tracer  at  a  late  stage  (day 
93)  of  the  simulation  averaged  over  the  domain  for  different 
values  of  A.  The  upper  ocean  tracer  concentration  increases 
(at  a  less  than  linear  rate)  with  increasing  A. 

main.  V{L)  is  normalized  by  the  total  variance  and 
plotted  as  a  function  of  the  length  scale  L  on  a  log-log 
plot  (see  Fig.  3).  We  obtain  shallow  curves  that  have 
small  slope  when  A  is  small  and  steeper  curves  for  large 
A.  The  approximate  slope  of  these  curves,  denoted  by 
p,  is  a  measure  of  the  intermittency  or  patchiness  of  the 
tracer  distribution.  Small  values  of  p  correspond  to  a 
more  intermittent  or  patchy  distribution  with  a  large 
amount  of  variance  at  a  given  length  scale.  This  kind 
of  distribution  results  from  rapid  adjustment  or  small 
A.  The  converse  is  true  for  large  values  of  p  and  A.  The 
interesting  thing  is  that  the  increase  in  p  scales  roughly 
as  the  logarithm  of  A. 

The  same  is  true  for  the  average  anomalous  concen¬ 
tration  of  tracer.  Fig.  4  shows  the  mean  vertical  distri¬ 
bution  of  the  tracers  with  different  A  towards  the  end  of 
the  simulation.  The  total  amount  of  tracer  in  the  upper 
layer  {z  <  Zm)  increases  with  A,  but  the  increase  scales 
roughly  as  the  logarithm  of  A  for  reasons  that  are  not 
well  understood. 

balance 

Conclusions 

Thus  it  is  shown  that  the  distributions  of  biogeo¬ 
chemical  quantities  in  the  upper  ocean  are  sensitive  to 


their  time  scale  of  response  to  various  forcing  processes. 
This  ought  to  be  taken  into  account  when  trying  to  de¬ 
sign  algorithms  that  quantify  the  surface  distribution  of 
one  biogeochemical  quantity  in  terms  of  another.  The 
patchiness  or  intermittency  of  the  distribution,  quan¬ 
tified  here  in  terms  of  the  approximate  slope  p  of  the 
variance  curve  in  log- log  space,  increases  more  or  less 
logarithmically  with  the  adjustment  time  A. 
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Abstract.  We  examine  the  effect  of  stirring  and  mixing  on  the  marine 
planktonic  ecosystem.  We  present  a  number  of  models  investigating  the 
impact  of  patchiness  in  the  population,  diffusion,  and  fluid  stirring  on  the 
spatial  structure  of  the  population  and  the  overall  production  rates  of  the 
system.  It  is  found  that  the  heterogeneity  of  the  system  significantly  affects 
the  way  the  system  behaves.  The  response  of  the  ecosystem  to  the  action  of 
the  fluid  flow  is  a  function  of  not  only  the  topology  of  the  flow  but  also  the 
dynamics  of  the  ecological  model  itself. 


1.  Introduction 

The  marine  ecosystem  plays  an  important  role  in  the 
cycling  of  carbon.  In  particular  it  acts  as  a  ‘carbon 
pump,’  utilising  atmospheric  CO2  through  photosyn¬ 
thesis,  a  proportion  of  the  carbon  being  exported  to 
the  deep  ocean,  and  thus  moderating  levels  of  CO  2  in 
the  atmosphere.  Ecological  models  are  now  incorpo¬ 
rated  into  models  of  climate  change  to  predict  global 
wanning  scenarios  (see,  e.g.,  it  Cox  et  aL,  2000).  A 
physical/ecological  model  used  in  climate  studies  needs 
to  incorporate  both  the  appropriate  ecological  dynam¬ 
ics  and  the  right  physics  impacting  on  the  biological 
production. 

Primary  production  in  the  ocean  is  dominated  by 
phytoplankton.  A  feature  of  the  distribution  of  phyto¬ 
plankton  distributions  is  that  it  is  very  heterogeneous 
or  ‘patchy’  {Bainbridge^  1957).  Structure  is  observed 
on  scales  ranging  from  metres  to  the  basin  scale  {Mann 
and  Lazier^  1996)  with  this  structure  often  observed  to 
be  associated  with  physical  features  such  as  eddies  and 
fronts  (e.g.,  {The  Ring  Group^  1981;  FaUcowski  et  aZ., 
1991;  Sirass^  1992).  A  number  of  important  questions 
arise: 

•  Does  the  patchiness  affect  the  dynamics  of  the 
ecological  system? 

•  Does  the  heterogeneity  of  the  system  impact  on 
the  overall  production  rates  and/or  community 
structure? 

•  Does  the  structure  affect  the  way  the  biological 
system  responds  to  changes  to  the  environment? 


damental  issues  related  to  the  problem  of  marine  bio¬ 
logical  patchiness.  The  models  should  be  thought  of  as 
caricatures  of  the  more  complex  system,  but  we  hope 
they  will  provide  the  building  blocks  for  understanding 
of  that  system.  The  work  is  very  much  ‘in  progress’ 
and  so  this  not  a  presentation  of  a  ‘complete  theory’ 
but  rather  some  initial  attempts  and  their  results. 


2.  Biological  reactions  and  fluid  flow 

Fluid  flow  can  impact  on  the  marine  ecological  sys¬ 
tem  in  two  distinct  ways:  (a)  through  the  vertical  move¬ 
ment  of  nutrients  and  biological  species  in  or  out  of 
the  euphotic  zone,  and  (b)  the  stirring  and  mixing  of 
components  of  the  system.  A  good  example  of  when 
both  are  important  is  baroclinic  instability  {Spall  and 
Richards,  2000),  Associated  with  the  instability  process 
is  a  permanent  vertical  displacement  carrying  nutrient 
rich  waters  into  the  sunlit  surface  layers.  These  nutri¬ 
ents  are  then  stirred  and  mixed  with  the  surroimding 
water  and  are  utilised  by  the  biota.  Here  we  will  con¬ 
centrate  on  this  second  process,  the  stirring  and  mixing 
by  lateral  fluid  flow. 


2.1.  General  advection-diffusion-reaction 
equation 

We  shall  model  the  ecological  system  using  the  advec- 
tion-diffusion-reaction  equation 

9E 

—  +  (u  •  V)  E  =  f (E)  +  V-  (DVE) 
where,  for  example,  the  state  vector  may  take  the  form 


E  = 


N  ' 

P 

Z 


Here  we  will  use  a  number  of  relatively  simple  mod¬ 
els  of  both  the  ecological  dynamics  and  the  fluid  flow 
in  order  to  try  and  understand  some  of  the  more  fun- 
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with  the  elements  representing  the  concentration  of  nu¬ 
trients  (N),  phytoplankton  (P)  and  zooplankton  (Z) 
expressed  in  terms  of  a  currency,  such  as  nitrogen. 

The  elements  are  advected  by  the  flow  u  and  dif¬ 
fused.  The  reaction  between  elements  is  represented  by 
f(E).(How  well  the  field  equation  approach  models  the 
marine  ecological  system  is  a  matter  of  debate.  There 
is  some  evidence  that  it  fails  if  behavioinral  aspects  of 
the  zooplankton  are  important  [Flierl  et  at,  1999). 

A  measrire  of  the  relative  ‘strength’  of  the  reaction 
terms  to  either  diffusion  or  advection  can  be  gauged  by 
considering  the  ratio  of  timescales,  i.e. 

K 

and 


as  Holling  type  III).  The  zooplankton,  Z,  grow  at  a  re¬ 
duced  rate  7  1.  The  mortality  term  h{Z)  is  taken 

to  be  either  a  linear  or  quadratic  function  of  Z,  corre¬ 
sponding  to  the  zooplankton  dying  of  old  age  or  being 
consumed  by  a  higher  predator,  respectively. 


f’hyli'pIankKm 


<1^  0:.  0.4  0  5 


respectively,  where  U  and  L  are  typical  velocity  and 
length  scales,  k  a  diffusion  coefficient,  and  p  the  reaction 
rate.  The  nondimensional  parameters  0^.  and  j3u  are 
equivalent  to  diffusive  and  advective  inverse  Damkohler 
numbers.  Large  values  imply  that  the  reaction  domi¬ 
nates. 

We  present  three  models  using  (2.1).  The  first  looks 
at  the  case  of  no  flow  and  the  structure  that  evolves  sim¬ 
ply  through  the  combination  of  reaction  and  diffusion. 
The  second  considers  simple  biology  in  a  simple  flow,  a 
pure  strain,  and  examines  the  structure  of  filaments  of 
biological  material.  The  third  takes  a  2D  geostrophic 
turbulent  flow  and  investigates  the  impact  the  stirring 
has  on  biological  production.  The  underlying  message 
is  that  the  response  of  the  ecology  to  the  action  of  the 
fluid  flow  is  a  function  of  not  only  the  topology  of  the 
flow  but  also  the  dynamics  of  the  ecological  model  itself. 

2.2.  Model  1;  Intrinsic  structure 

Consider  the  case  of  no  flow  (u  =  0).  We  shall  con¬ 
sider  a  two  compartment  model: 


where  P  €  [0, 1],  and  the  reaction  terms  are 
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Figure  1.  Model  1  with  linear  mortality.  Upper  panel, 
plot  of  phytoplankton  concentration  as  a  function  of  distance 
(given  in  tens  of  metres)  and  time.  Lower  panel,  trajectories 
of  the  mean  values  of  P  and  Z  for  the  randomly  perturbed 
(crosses)  and  unperturbed  (dots)  cases.  Also  shown  are  the 
null-clines  of  the  reaction  equations  (i.e.,  dPjdt  =  0  and 
dZjdt  ~  0.  The  null-clines  intersect  giving  a  single  stable 
equilibrium  point 

The  model  for  the  reaction  terms  is  that  of  Truscott 
and  Brindley  (1994).  It  is  chosen  because  of  its  sim¬ 
plicity  and  because  the  mathematical  structure  of  the 
ecological  dynamics  is  known.  In  particular  the  system 
becomes  excitable  (i.e.,  small  but  finite  perturbations 
from  an  equilibrium  point  can  lead  to  large  excursions 
in  model  phase  space)  if  both  7  and  v  are  small  enough. 

The  diffusion  tensor  is  taken  to  be 


f  = 


The  reaction  terms  for  the  phytoplankton,  P,  include 
limited  growth  (known  as  logistic  growth)  and  grazing 
by  zooplankton  (the  fimctional  form  used  here  is  known 


so  that  zooplankton  may  diffuse  at  a  different  rate  to 
phytoplankton  (allowing  cross  diffusion,  i.e.,  zooplank¬ 
ton  diffusing  up  the  phytoplankton  gradient,  has  also 
been  investigated). 
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Here  we  present  two  examples  to  illustrate  the  dif¬ 
ferent  behaviour  of  the  system  for  linear  (Figure  1)  and 
quadratic  (Figure  2)  mortality  (further  details  of  the 
behaviour  of  the  system  are  given  in  Brentnall  et  d. 
(2001).  We  take  a  one-dimensional  case,  i.e.,  variables 
only  vary  in  one  horizontal  direction.  The  system  is  ini¬ 
tialised  on  a  grid  with  random  values  of  P  and  Z  about 
some  mean  value.  The  size  of  the  rectangular  box  in 
the  lower  panel  of  each  figxire  gives  an  indication  of  the 
spread  of  values.  In  both  the  linear  and  quadratic  mor¬ 
tality  cases  a  structure  appears.  The  difference  is  that 
this  structure  persists  for  a  longer  time  for  the  quadratic 
mortality  case.  The  length  scale  of  the  structure  scales 
with  -y/JijK  (i.e.,  the  diffusive  inverse  Damkohler  num¬ 
ber,  /3«  is  the  relevant  parameter).  With  a  growth  rate 
of  1  day'“^  and  diffusion  coefficient  of  10”^  m^s“^  the 
emergent  scale  is  of  order  500  m. 


I'hvtof'lajnktt'n 
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Figure  2.  Model  1  with  quadratic  mortality.  As  for  Figure 
1,  but  in  this  case  the  nuil--clines  intersect  at  three  points, 
with  two  stable  equilibria  at  P  =  0.05  and  0,7  and  an  un¬ 
stable  equilibritim  point  at  P  =  0.27 


the  perturbation  is  introduced.  With  quadratic  mortal¬ 
ity,  the  two  trajectories  converge  to  two  different  sta¬ 
ble  equilibria,  with  the  perturbed  case  converging  very 
slowly.  Clearly  the  heterogeneity  of  the  biology  (with 
diffusion  acting)  is  playing  a  significant  role  in  the  time 
evolution  of  the  system.  It  is  also  clear  that  we  need  to 
consider  what  happens  when  there  is  a  flow,  which  we 
do  in  the  next  section 

2.3.  Model  2:  filaments 

We  now  consider  a  pmre  strain  flow 

u=  (-AxjAy) 

Such  a  flow  win  tend  to  pull  a  tracer  into  a  thin  filament. 
For  an  inert  tracer  the  filament  width  attains  a  value 


when  the  straining  of  the  flow  is  balanced  by  diffusion. 
(34  (@)  has  shown  that  filaments  of  an  exponentially 
growing  tracer  have  the  same  width  as  an  inert  tracer) 

The  biology  we  will  use  is  a  simplified  version  of  that 
used  above,  namely 

E=[P] 

with 

f=[/4P(l-P)] 

A  feature  of  this  system  is  that  the  combination  of 
logistic  growth  and  diffusion  allows  the  existence  of  re¬ 
active  travelling  waves  (or  Fisher  waves)  which  have 
a  minimum  speed  2y/f^  (c.f.,  Murray^  1993).  These 
weaves  are  important  in  transmitting  ‘information’  and 
means  that  the  biological  population  can  advance  faster 
than  through  pure  diffusion.  The  waves  are  important 
in  setting  the  width  of  filaments. 

We  may  expect  the  travelling  wave  to  be  brought  to 
rest  when  its  speed  matches  that  of  the  converging  flow, 
such  that  the  width  of  the  filament 


The  trajectory  in  P/Z  phase  space  changes  signif¬ 
icantly  when  horizontal  structure  is  introduced.  The 
trajectories  for  the  mean  P  and  Z  with  and  without 
the  small  random  perturbation  are  shown  in  the  lower 
panel  of  Figures  1  and  2.  With  linear  mortality,  al¬ 
though  the  two  trajectories  converge  to  the  same  stable 
equilibrium,  the  excursion  in  P  is  much  greater  when 


Indeed  this  is  found  to  be  the  case  for  sufficiently 
large  values  of  Pu  =  A,  an  inverse  advective  Damkohler 
number  (see  the  lower  panel  of  Figure  3).  The  shape  of 
the  distribution  of  concentration  changes  with  increas¬ 
ing  (upper  panel  Figure  3)  from  a  Gaussian  to  a 
square  wave  as  shown  by  the  kurtosis  of  the  distribu¬ 
tion  (lower  panel  Figure  3)  {McLeod  et  aZ.,  2001). 
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Figure  3.  Model  2,  filament  structure.  Upper  panel:  the 
distribution  of  phytoplankton  across  the  filament  for  three 
values  of  /?„;  dash-dot  /3„=2,  dash  0u=lO  and  solid  0„=5O. 
The  horizontal  distance  has  been  scaled  with  ./njl/X.  Lower 

panel:  the  filament  width  (scaled  with  y/it/X)  and  kurtosis 
as  a  function  of  0^- 


potantial  vortictty 


Figure  4.  Model  3,  fluid  stirring.  The  potential  vorticity 
field. 


The  biological  model  is  slightly  more  complicated 
than  before. 


with 


It  is  interesting  to  note  that 


Wi 

J 

i.e.,  the  filament  width  for  a  reactive  tracer  will  be 
greater  than  that  of  an  inert  tracer  for  /?„  >  1.  Using  an 
observed  filament  width,  and  strain  rate,  to  estimate  k 
will  lead  to  an  overestimate  by  a  factor  0  if  the  reaction 
is  not  taken  into  account. 


2.4.  Model  3:  fluid  stirring 

Lastly  we  consider  the  case  of  the  ecosystem  be¬ 
ing  stirred  by  a  turbulent  flow  field.  We  use  the  2D 
geostrophic  turbulence  model  of  Babiano  et  al.  (1987) 
in  a  512  km^  periodic  domain.  The  flow  is  forced  to 
give  a  statistically  steady  state.  An  example  of  the  po¬ 
tential  vorticity  is  given  in  Figure  4.  A  characteristic 

2D  turbulence  is  that  the  flow  is  inhabited  by  strong 
coherent  vortical  structures  surrounded  by  a  straining 
flow  in  the  intervening  regions.  The  coherent  structures 
act  as  transport  barriers  inhibiting  the  mixing  of  tracers 
between  the  eddies  and  surrounding  fluid  (Provenzale, 
1999). 


(1  -  7i)  Z  +  fipP  -I-  fizZ^  -  fijoD 

where  D  represents  detritus  and  there  is  an  additional 
sinking  term  --WsD/h  in  the  equation  for  detritus.  The 
biological  model  is  that  of  Oschlies  and  Garcon  (1999) 
and  has  been  chosen  because  it  has  been  shown  to  ca¬ 
pable  of  reproducing  the  seasonal  cycle  in  both  olig- 
otrophic  (nutrient  depleted)  and  nonoligotrophic  re¬ 
gions  with  the  same  set  of  parameter  values.  Details 
of  the  biological  and  flow  model,  including  the  parame¬ 
ter  values,  are  given  in  Martin  et  al  (2001). 

The  biological  system  is  forced  by  a  source  of  nutrient 
r.  -  s{No  -  N) 

where  s  is  a  function  of  position  or  some  property  of  the 
flow,  and  can  be  thought  of  modelling  the  input  of  nu¬ 
trients  to  the  euphotic  zone  through  upwelling.  Martin 
et  al  (2001)  investigate  how  the  productivity  depends 
upon  the  distribution  of  the  upwelling  by  comparing 
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spatially  stationary  sources  with  sources  correlated  or 
decorrelated  with  flow  features.  They  conclude  that  the 
increased  productivity  brought  about  by  upwelling  is  a 
function  of  the  rate  of  upwelling,  the  distribution  of  the 
sources,  and  the  mixing  efficiency  of  the  flow. 

We  consider  two  cases,  the  first  with  the  source  con¬ 
fined  to  a  single  patch  (Figure  5)  and  the  second  with 
the  source  distributed  over  64  smaller  patches  but  such 
that  the  total  area  is  the  same  as  in  the  first  case  (Fig¬ 
ure  6).  The  figures  show  the  distribution  of  primary 
production  at  a  time  when  the  biology  is  in  a  statistical 
equilibrium. 
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Figure  5.  Model  3,  fluid  stirring.  The  forcing  of  nutrient 
and  primary  production  resulting  from  stirring  by  a  2D  tur¬ 
bulent  field.  The  production  ranges  from  0.1  (black)  to  0.45 
(white)  nMol  N  m“^  d“*^. 


tion  has  increased  by  36%.  The  fluid  motion  has  the 
effect  of  increasing  the  surface  area  between  the  nutri¬ 
ent  enriched  and  surrounding  waters.  A  striking  feature 
of  the  distribution  of  production  is  that  it  is  limited  to 
the  vicinity  of  the  forcing  (an  inert  tracer  will  be  spread 
quickly  across  the  entire  domain).  The  reaction  limits 
the  extent  of  the  region  of  high  nutrients  and  hence  the 
area  of  contact  with  the  surrounding  waters.  Reduc¬ 
ing  the  reaction  rate  (reducing  (3u)  will  allow  greater 
stirring  to  take  place  before  the  nutrients  are  consumed 
and  hence  increase  the  area  of  contact,  counteracting 
the  decrease  in  overall  production  because  of  the  de¬ 
creased  production  rate.  Experiments  varying  flu  show 
that  the  two  effects  may  balance  each  other. 
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Figure  6.  As  for  Figure  5  but  for  a  distributed  source. 


Referring  to  the  single  source  case  (Figure  5),  the 
nutrient  forcing  has  produced  elevated  rates  of  produc¬ 
tion  above  the  background  level  of  0.1  mMol  N  m"“^ 
d~^.  Compared  to  the  case  with  the  same  forcing  but 
no  fluid  stirring  (u  =  0)  the  areal  average  of  produc- 


As  expected,  distributing  the  source  increases  the 
mixing  and  hence  production  rate  (Figure  6).  In  this 
case  the  overall  production  rate  has  been  increased  by 
137%  above  the  case  with  no  mixing. 
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3.  Concluding  remarks 

We  have  reported  on  a  number  of  investigations  into 
the  impact  of  heterogeneity  of  the  marine  ecological  sys¬ 
tem  on  the  behaviour  of  that  system  as  a  whole.  The 
results  demonstrate  that  the  heterogeneity  cannot  be  ig¬ 
nored.  Applying  the  results  from  studies  such  as  those 
presented  here  is  difficult.  We  freely  admit  that  the 
results  of  our  studies  to  date  are  somewhat  of  an  anec¬ 
dotal  character.  There  are  caveats  on  the  methodology 
such  as  the  exclusion  of  the  diurnal  and  in  particular 
the  seasonal  cycle,  both  of  which  impose  a  strong  pac¬ 
ing  of  the  system,  and  which  need  to  be  included  in 
future  studies. 

It  is  disconcerting  that  the  results  are  very  depen¬ 
dent  on  the  form  of  the  ecological  model  (as  an  addi¬ 
tional  example,  the  enhancement  of  productivity  found 
in  model  3  was  significantly  reduced  when  a  simpler 
[N,P]  was  used).  Perhaps  not  a  surprising  remark  but 
one  which  is  often  ignored.  This  issue  needs  to  be  ex¬ 
plored  further,  but  it  does  make  the  task  of  developing 
frobust’  ecological  models  that  much  more  difficult. 

What  is  clear  is  that  viewing  marine  ecology  as  a 
simple  ID  system  (the  space  dimension  being  vertical) 
may  well  be  erroneous  and  produce  misleading  results. 
Ecological  models  are  often  ‘fitted’  or  rejected  on  the 
basis  of  the  performance  of  a  ID  version  of  the  model 
compared  to  observations.  At  best  the  ‘fitted’  param¬ 
eters  may  be  dependent  on  the  physical  environment, 
implicitly  including  the  effects  of  unresolved  physical 
and  ecological  processes.  At  worst  the  functional  form 
of  the  model  chosen  through  this  comparison  may  be 
wrong.  In  either  case,  in  using  an  ecological  model  to 
predict  changes  to  the  ecological  system  as  the  physical 
environment  changes,  we  need  to  ensure  that  the  model 
captures  the  correct  impact  of  those  changes. 
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Abstract.  Advection-diffusion-reaction  (adr)  models  link  physical  oceanog¬ 
raphy  and  biological  oceanography.  These  models,  which  describe  biology 
using  continuous  concentration  fields,  usually  neglect  individual-scale  fiuctu- 
ations.  I  describe  a  stochastic  individual-based  model,  called  the  Brownian 
bug  process,  which  illustrates  some  of  the  surprising  issues  associated  with 
the  neglect  of  fluctuations  by  ADR  descriptions.  The  Brownian  bug  model 
is  an  ensemble  of  random  walkers  which  suffer  birth  and  death  at  constant 
mean  rates.  (Probabilists  will  recognize  the  Brownian  bug  model  as  the 
simplest  example  of  a  “superprocess.”)  Binary  division  puts  two  bugs  (parent 
and  progeny)  at  the  same  position  and  the  accumulation  of  these  small-scale 
density  fluctuations  can  produce  palpable  nonuniformities  on  large  scales. 

In  other  words,  provided  that  the  diffusion  is  not  too  strong  relative  to  the 
reproduction  rate,  a  spatially  homogeneous  initial  condition  spontaneously 
develops  patches  and  voids.  The  wavenumber  signature  of  these  reproductive 
pair  correlations  is  that  the  spectrum  of  density  fluctuations  remains  white 
but  rises  linearly  with  time.  Diffusion  opposes  this  reproductive  forcing,  most 
effectively  at  large  wavenumbers,  so  that  a  red  spectrum  develops. 


1.  Introduction 

Individuals  are  the  fundamental  unit  of  ecology  but 
it  is  not  feasible  to  model  ecological  processes  by  track¬ 
ing  single  organisms.  Instead,  ecologists  often  em¬ 
ploy  advection-diffusion-reaction  (adr)  approximations 
which  describe  biology  using  the  ‘concentration’  ^(x,  t) 
{e.g.,Flierl  et  al.  1999).  The  concentration  is  defined 
via  a  sample  area^  dA  surrounding  any  x  at  time  t; 

C7(x,  t)  dA  =  expected  number  of  organisms  in  dA. 

(1) 

The  dimensions  of  dA  are  such  that  the  strong  nonuni¬ 
formities  characteristic  of  scales  comparable  to  that  of 
an  individual  are  greatly  reduced  by  averaging. 

Ecological  ADR  models  originated  with  papers  by 
Fisher  (1937),  Kolmogorov  et  al.  (1937)  and  Skellam 
(1951).  These  authors  independently  proposed 

Ct  =  'yC-  AC2  +  (2) 

as  a  model  for  the  growth,  saturation  and  dispersion  of 
a  population.  This  chapter  of  theoretical  ecology  is  ei¬ 
ther  a  slide  down  from  this  summitt,  as  the  adr  model 
is  applied  to  specific  cases,  or  an  ascent  as  the  ADR 

^In  this  note  I  will  work  in  two-dimensions  so  that  in  (1)  I  am 
using  a  control  area  rather  than  a  control  volume. 


approximation  is  derived  and  its  limitations  are  better 
imderstood.  The  1997  compilation  edited  by  Tilman 
and  Kareiva  gives  a  good  overview  of  both  these  em 
deavours.  In  this  review  we  will  admire  some  scenery 
on  the  climb-up  to  the  ADR  approximation  by  describ¬ 
ing  a  maximally  simplified  model —  the  Brownian  bug 
process. 


Ocean  ecology  is  a  natural  setting  for  ADR  descrip¬ 
tions  because  the  spatial  domain  is  large  and  the  pop¬ 
ulations  of  some  species  are  enormous.  Population  size 
matters  because  C  in  (1)  is  an  average  or  expectation. 
The  actual  number  of  organisms  in  d.4,  say  iV,  is  a  ran¬ 
dom  variable  and  using  the  definition  in  (1)  iV  =  CdA. 
The  difference  N'  =  iV  —  iV  is  a  fluctuation.  Deter¬ 
ministic  ADR  equations  ignore  these  fluctuations.  The 
naive  justification  for  discarding  fluctuations  is  that  N 
is  a  Poisson  random  variable  (e.g.,  Durrett  and  Levin 
1994)  so  that 


N' 


N 


(3) 


Thus  a  deterministic  description  like  (2)  may  be  useful 
for  copepods,  but  not  for  whales. 

In  oceanography,  ADR  models  are  a  main  line  of  com¬ 
munication  between  physicists  and  biologists.  Physical 
oceanographers  put  the  A  into  the  ADR  equation  (2)  by 
adding  the  term  u*  VC  to  the  left  hand  side.  The  ve- 
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Figure  1.  A  Brownian  bug  simulation  using  A  =  0.001, 
p  =  q  =  1/4,  A^o  =  4000,  and  U  =  0  (no  advection).  The 
position  of  each  bug  is  plotted  as  a  point  in  the  unit  square. 
The  top  panel  shows  the  initial  condition,  the  middle  panel 
is  after  10  cycles  and  the  bottom  after  100  cycles. 


locity  u(x,  t)  is  supplied  by  some  physical  model.  (For 
example,  see  the  paper  by  Kelvin  Richards  in  these  pro¬ 
ceedings.)  Physicists  find  this  type  of  model  appealing 
because  the  mathematical  structure  is  familiar,  and  be¬ 
cause  it  is  easy  to  plug  biology  into  existing  ocean  mod¬ 
els.  This  is  also  my  excuse  for  discussing  these  issues  in 
a  meeting  devoted  to  oceanographic  stirring  and  mix¬ 
ing. 

When  I  described  the  Brownian  bug  process  in  my 
presentation  at  the  2001  ^Aha  Huliko^a  meeting,  Mel 
Briscoe  pointed  out  that  the  same  model  had  been  in¬ 
dependently  presented  at  the  1997  'Aha  Huliko'a  meet- 
ing  by  Robert  Adler  and  identified  as  a  superprocess. 
Adler  approaches  this  subject  from  the  perspective  of 


a  probabilist  and  the  references  in  his  article  provide 
an  entry  into  the  extensive  and  very  mathematical  lit¬ 
erature  on  superprocesses.  A  Google  search  on  “super¬ 
process”  gives  about  250  hits,  most  of  which  are  on  the 
homepages  of  mathematicians.  Table  stakes  in  the  su¬ 
perprocess  community  are  an  imderstanding  of  measure 
theory  and  branching  processes — most  oceanographers 
can’t  ante. 

Another  closely  related  class  of  problems  is  diffusion- 
limited  reactions,  e.g.,  see  the  review  by  Mattis  and 
Glasser  (1998),  which  emphasizes  the  utility  of  quan¬ 
tum  field  theory  in  these  problems.  There  seems  to 
be  little  communication  between  physicists  interested 
in  diffusion-limited  reactions  and  probabilists  studying 
superprocesses. 

Ecologists  and  oceanographers  will  not  find  it  easy  to 
understand  the  review  articles  written  by  either  of  these 
communities.  This  review,  which  is  intended  for  my 
oceanographic  colleagues,  might  then  be  titled  “Super¬ 
processes  for  pedestrians,  with  no  quantum  field  theory 
and  no  measure  theory.” 

2.  Brownian  bug  simulations 

The  Brownian  bug  process  is  an  ensemble  of  random 
walkers  (idealized  as  points)  milling  around  in  continu¬ 
ous  time  and  space  and  simultaneously  reproducing  and 
dying  at  constant  mean  rates.  Reproduction  is  random 
in  time  and  occurs  via  bineiry  division,  so  that  parent 
and  progeny  are  at  the  same  point  for  just  an  instant 
before  their  independent  random  walks  start  to  produce 
y/Dt  separation.  Death  is  also  random  and  dead  bugs 
are  removed  from  the  ensemble.  The  addition  of  advec¬ 
tion  to  this  scenario  ( Young  2001)  results  in  the  simplest 
model  of  a  planktonic  species  reproducing  and  dying  in 
a  turbulent  fluid. 

To  simulate  the  continuous  process  described  above 
on  a  computer  one  must  discretize.  There  are  sev¬ 
eral  ways  of  doing  this,  and  I  describe  in  the  appendix 
an  individual-based  Monte  Carlo  procedure  which  pro¬ 
duces  Figure  1.  The  top  psinel  shows  the  homogeneous 
initial  condition  and  the  middle  and  bottom  panels  the 
results  after  10  and  100  cycles  of  birth,  death  and  dif¬ 
fusion.  Patches  of  bugs  spontaneously  form  out  of  the 
spatially  uniform  initial  conditions. 

Failure  of  deterministic  ADR.  The  ADR  ap¬ 
proximation  of  the  simulation  in  Figure  1  is  simply 

Ct  =  DV^C+{\-p)C,  (4) 

where  D  is  the  diffusivity,  A  is  birth  rate,  and  p.  the 
death  rate.  (For  the  relation  between  D,  A,  and  p  and 
the  parameters  in  the  simulation  see  the  appendix.)  But 
in  Figure  1  birth  and  death  are  equiprobable,  X  =  p, 
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and  consequently  (4)  collapses  to  the  diffusion  equa¬ 
tion.  Thus,  with  the  initial  condition  ^(x,  0)  =  Nq/L^, 
the  solution  of  the  ADR  description  is  ^(x,  t)  =  Nq/L^. 
Uniform  concentration  is  not  a  good  characterization  of 
Figure  1  and  we  conclude  that  the  adr  approximation 
fails  to  describe  the  Brownian  bug  process. 

A  stochastic  ADR  equation.  Although  deter- 
minstic  ADR  fails,  there  is  a  stochastic  partial  differen¬ 
tial  equation  model  which  does  capture  the  fluctuations 
exhibited  by  the  individual-based  model.  This  stochas¬ 
tic  equation  is  something  like 

Ct=DV^C-\^VCw{y:,t)  (5) 

where  w  is  a.  white  Gaussian  noise.  The  y/C  in  (5) 
occurs  because  in  the  control  volume  d  the  stochastic 
imbalance  between  births  and  deaths  is  proportional  to 
y/C.  I  qualified  the  introduction  of  (5)  with  “something 
like”  because  mathematicians  (e.g.,  Adler  1997)  tell  us 
that  (5)  makes  sense  only  in  one  dimension,  d  =  1,  but 
not  when  d  >  2:  in  that  case  the  concentration  C(x,  i) 
cannot  be  defined.  I  do  not  fully  imderstand  this  point. 
In  any  event,  Adler  proceeds  to  introduce  an  integrated 
version  of  (5)  which  he  claims  is  sensible  even  in  d  >  2. 
We  press  on... 

Density  fluctuations.  The  spatial  clumping  in 
Figure  1  occurs  because  birth  is  always  next  to  a  living 
bug,  while  death  can  occur  anywhere.  Alternatively,  in 
some  regions  because  of  bad  luck  there  is  local  extinc¬ 
tion  of  the  bug  population.  Diffusion  from  the  surviving 
centers  of  population  is  not  fast  enough  to  recolonize 
the  voids.  This  suggests  that  to  produce  patches  the 
diffusion  D  must  be  weak  relative  to  the  growth  rate 
A  =  /X.  We  can  identify  the  nondimensional  parameter 
that  controls  this  transition  by  noting  that  the  initial 
separation  distance  between  bugs  (the  “specific”  length) 
is£s  =  L/ y/No.  On  the  other  hand,  using  the  diffusivity 
D  and  the  growth  rate  A,  we  can  define  a  “reproduc¬ 
tive”  length  £r  =  ^/D/X,  which  measures  the  separa¬ 
tion  that  develops  between  parent  and  progeny  before 
there  is  another  reproductive  event.  We  expect  that 
patchiness  will  be  strong  if  the  nondimensional  number 


Figure  2.  Results  of  7  simulations  each  with  No  =  2^® 
bugs  at  t  =  0.  The  unit  square  is  divided  into  a  lattice 
of  32  X  32  boxes  (i.e.,  p  =  5)  and  defined  in  (10)  is 

computed.  The  parameters  in  section  5  are  p  =  g  =  1/4 
and  the  step-length  of  the  random  walk,  A,  is  varied  as 
indicated.  When  the  diffusion  is  strong  <7  remains  close 
to -its  initial  Poisson  value,  <7(5,0)  w  1/16,  The  dotted 
curve  is  <7gw  ^  y/{l  +  2fjLt)fnp,  which  is  computed  using 
the  Galton- Watson  solution  in  (18)  and  (19)  below. 


There  are  2^^  boxes  so  that  the  expected  number  of 
bugs  per  box  is 

Up  =  2~^pNq  .  (8) 

The  occupation  number  of  box  b  is  the  fluctuating  quan¬ 
tity 


nb{t)  =  the  number  of  bugs  in  box  b  at  t,  (9) 

where  b  =  1, . . . ,  2^^  indexes  the  boxes.  To  measure  the 
size  of  fluctuations  in  nt  relative  to  the  expected  value 
Up  we  use  the  ratio  of  the  rms  value  of  n{b,t)  to  np-. 


cr(p,t) 


2^P 

\  t)  -  np]'^ /np . 

\6=1 


(10) 


is  large.  If  the  ratio  is/^r  is  small  then  density  flue- 
tuations  should  be  small  because  the  diffusion  will  be 
effective  at  mixing  a  family  into  the  background  of  im- 
related,  and  therefore  uncorrelated,  bugs. 

To  illustrate  these  considerations,  the  L  x  L  square  is 
divided,  like  a  chessboard,  into  smaller  squares  (“boxes”), 
each  of  size 

4  =  2-PL. 


If  o’(p,  t)  I  then  n(6,  t)  is  fluctuating  weakly  aroimd 
the  expected  value  rip. 

Figure  2  shows  <t(p,  t)  for  a  set  of  simulations  with 
various  difFusivities  and  p  =  5.  The  a(p,  0)  is  small  be¬ 
cause  the  box  size  £p  =  1/32  is  much  greater  than  the 
specific  length,  £5  =  1/512.  If  the  diflFusivity  is  strong 
enough  (for  example,  A  >  2“®  in  Figure  2)  then  aip.t) 
experiences  mild  growth  but  cr(p,  00)  <C  1.  In  this  case 
the  diffusivity  is  arresting  reproductively  driven  fluctu¬ 
ations.  On  the  other  hand,  if  the  diffusion  is  small  then 
the  growth  of  a  proceeds  till  C7(p,  00)  >  0(1),  which 


(7) 
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indicates  large  fluctuations  in  the  density  defined  using 
the  box-scale  £p.  In  particular,  if  there  is  no  diffusion 
(A  =  0)  then  one  can  show  using  the  Galton- Watson 
solution  in  (18)  and  (19)  that  cr(p,t)  a  \/(l  +  2fjit)/np. 
Notice  that  by  decreasing  the  diffusivity  we  are  increas¬ 
ing  the  nondimensional  parameter  in  (6). 

It  would  be  useful  to  be  able  to  give  a  scaling  law  for 
the  saturated  value,  cr{p,  oo),  in  terms  of  D,  fj,,  Ip  and 
the  other  external  parameters.  I  have  not  been  able  to 
achieve  this. 

3.  Galton- Watsonology 

The  Brownian  bug  model  combines  two  classical 
stochastic  processes:  Brownian  motion  and  a  Galton- 
Watson  branching  process.  Brownian  motion  is  familiar 
to  most  oceanographers,  but  the  Galton- Watson  pro¬ 
cess  is  not.  This  section  summarizes  some  basic  prop¬ 
erties  of  Galton- Watson  process. 

Suppose  that  we  switch  off  the  motion  of  the  bugs, 
i.e.,  we  set  the  step  length  of  the  random  walk  to  zero. 
In  this  case  each  bug  founds  a  family  which  sits  at  the 
initial  spatial  location.  Newly  born  bugs  pile  up  and 
build  a  tower  on  top  of  their  ancestors.  The  towers  go 
up  and  down  as  bugs  are  born  and  die.  Towers  can  dis¬ 
appear  forever  (extinction)  if  a  run  of  bad  luck  wipes 
out  the  family.  The  statistics  of  these  families,  particu¬ 
larly  the  probability  of  extinction,  were  studied  by  Gal¬ 
ton  and  Watson  in  the  nineteenth  century  (see  Harris, 
1963,  and  chapter  XVTI  of  Feller,  1968).  In  this  review 
we  closely  follow  Feller’s  continuous-time  formulation  of 
the  Galton- Watson  process. 

The  fundamantal  variables  of  the  Galton-Watson 
process  are  the  probabilities 

Pk{t)  =  the  probability  that  there  are  k  bugs  at  t. 

We  study  the  proliferation  of  a  family  descended  from 
a  single  urbug  using  the  initial  condition  pi(0)  =  1  and 
P*5(0)  =  0  if  7^  1.  The  probability  that  a  family  is 
extinct  at  t  is  po(t)-  Given  Pk{t),  one  calculates  the 
average  of  a  function  of  k  according  to  the  definition 

OO 

fik,t)  =  '^f{k,t)pkit).  (11) 

fc=0 

Normalization  of  probabilities  requires  1  =  1  and  the 
mean  population  of  a  family  is  k{t). 

The  family  becomes  extinct  when  a  family  with  one 
surviving  member  suffers  a  death;  accordingly  the  prob¬ 
ability  of  extinction  po  increases  like 

(12) 


The  evolution  of  the  other  probabilities  with  A;  >  1  is 
obtained  from  the  hierarchy  of  differential  equations 

Pk  =  A(fc  -  l)pfc_i  -  (A  -F  p)kpk  +  p{k  -I-  l)pfc+i.  (13) 

The  term  -(A  -I-  fj,)kpk  on  the  right  hand  side  of  (13) 
appears  because  a  family  of  k  bugs  might  make  a  tran¬ 
sition  to  fc  -I-  1  bugs  if  any  of  its  k  members  repro¬ 
duce  (probability  per  time  Xk)  or  make  a  transition  to 
fc  —  1  bugs  if  any  of  the  fc  members  die  (probability  per 
time  pfc).  Analogous  reasoning  is  used  to  deduce  the 
other  terms  containing  fc  -  1  and  fc  +  1  on  the  right 
hand  side  of  (12). 

Mean  and  variance.  From  (12)  and  (13)  one  can 
show  using  the  definition  (11)  that  I  =  1  and 

dfc 

^  =  (A-M)fc.  (14) 

This  result  establishes  the  connection  between  the  Gal¬ 
ton-Watson  process  and  the  ADR  equation  (3):  both 
fc  and  C  are  expectations  defined  by  ensemble  averages 
and  A  -  p  is  the  growth  rate  of  the  expected  population. 
In  fact,  if  we  start  with  Nq  bugs,  each  of  which  is  a 
family  with  one  member,  then  because  the  evolution  is 
independent  f  CdA  =  Nok. 

The  statistic  fc  contains  no  information  concerning 
the  fluctuations  in  population.  That  is,  at  times  f  >  0 
some  families  are  extinct,  while  others  are  very  large.  A 
coarse  description  of  these  fluctuations  is  provided  by 
the  variance  k^  -  P,  or  better  by  the  ratio 

v=  \/k^-  P/fc ,  (15) 

which,  like  (3),  compares  the  fluctuations  to  the  mean. 
Assiduous  summation  of  (12)  and  (13)  shows  that 

dP 

-^  =  2(A  -  p)fc2 -f  (A -h  p)fc .  (16) 

If  the  death  rate  balances  the  birth  rate,  so  that  A  =  p 
and  fc  =  1,  then  the  solution  of  (16)  is 

k^  =  l  +  2fit,  (A  =  p).  (17) 

In  this  case  v  =  y/TpI,  indicating  that  the  system  is 
dominated  by  fluctuations  as  t  — >  oo. 

The  special  case  A  =  p  in  (17)  is  the  critical  point 
of  the  branching  process.  If7  =  A-  p>0  then 
fc  ~  exp(7t)  00  and  i;  ~  exp(-7t/2)  ->  0.  On 
the  other  hand,  if  7  <  0  then  v  ~  exp(-7f/2)  00. 
The  conclusion  is  that  the  system  is  dominated  by  fluc¬ 
tuations  if  the  growth  rate  7  is  less  than  or  equal  to 
zero. 


Po  =  fipi . 
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A  solution  with  X  =  fi.  The  variance  is  only  a 
rough  guide  to  behaviour  of  fluctuations.  A  much  more 
informative  result  is  provided  by  an  explicit  solution  of 
(12)  and  (13)  which  is  obtained  using  the  generating 
function  method  {Feller  1968): 


Po{t)  = 


fxt 

1  +  /it  ’ 


(18) 


and  if  A:  >  1: 


Pkit)  = 


(1  +/if)'=+i  ' 


(19) 


At  large  times,  /it  »  1,  see  that  most  families  are  ex¬ 
tinct,  po{t)  ->■  1.  That  is,  at  large  times  the  most  prob¬ 
able  number  of  members  in  any  given  family  is  zero  — 
in  fact  extinction  is  certain.  On  the  other  hand  k  =  l 
so  that  the  expected  number  of  members  in  a  family 
(counting  extinct  famihes  as  having  zero  members)  is 
always  constant. 

To  reconcile  the  extinction  of  most  families  with  a 
constant  expected  population,  we  make  a  distinction 
between  surviving  families  (meaning  a  family  with  at 
least  one  member)  and  extinct  families.  Suppose  that 
we  start  with  a  large  number  ATo  »  1  of  individuals, 
each  of  which  founds  a  family.  At  t  the  number  of  sur- 
viving  families  is  a  random  variable,  F{i).  The  expected 
number  of  surviving  families  is  F  =  (1  -po)No,  or  using 


F(t)  = 


No 

1  +  /xi 


(20) 


Let  the  random  variable  n  be  the  number  of  individuals 
in  a  surviving  family.  Prom  (18)  and  (19),  the  proba¬ 
bility  distribution  of  n,  conditioned  on  the  survival  of  a 
family,  is 


(/xt)””^ 

-  (1  +  nt)n  >  («  =  1,  2  •  •  ■) .  (21) 

and  consequently 

n(f)  =  pt+l,  n2  =  (2/tt  +  1)  (/it  -fl) .  (22) 

Thus,  the  number  of  surviving  families  decreases  like 
t  while  the  membership  of  a  surviving  family  grows 
like  to  keep  the  total  population  fixed. 

These  remarks  illustrate  Galton  and  Watson’s  con¬ 
clusion  that  the  “observed  diminution  of  siumaxnes  among 
families  whose  history  we  can  trace,  is  not  a  sign  of  their 
dimimshed  fertility.”  Instead  the  extinction  of  most 
families,  and  the  apparent  fecundity  of  a  few  exceptions, 
is  a  result  of  “the  ordinary  law  of  chances.” 


4.  Spectred  dynamics 

We  began  section  3  by  imagining  that  the  bugs  were 
immobile  so  that  the  surviving  families  formed  towers 
on  top  of  the  urbug.  We  then  ignored  the  spatial  aspects 
of  the  model,  and  focussed  simply  on  the  statistics  of  the 
bug  population.  Now  let  us  return  to  the  spatial  domain 
and  study  the  spectrum  of  concentration  fluctuations. 

For  the  moment,  we  continue  to  assume  that  the  bugs 
are  immobile  {D  =  0).  The  discussion  surrounding 
(20)  through  (22)  shows  that  the  height  of  a  typical 
family  tower  grows  like  1  H-  pt  and  the  average  sep¬ 
aration  between  towers  grows  hke  (s/VT+Jd,  where 
is  =  Lj y/Nl.  We  can  write  the  microscopic  density  of 
bugs  in  a  single  realization  of  the  process  as 

F 

p{x,t)  =  Y,nf5ix-jCf),  (23) 

/=i 


where  F{t)  is  the  number  of  surviving  families,  n/  is  the 
number  of  members  in  family  f  and  Xf  is  the  location 
of  the  family.  Notice  that  the  concentration  is 

C={p}=  Nq/L^  ,  (24) 

where  the  ()  denotes  an  ensemble  average  over  the  ran¬ 
dom  locations  ^  Xf  and  an  average  over  the  n^’s  using 
(21).  The  average  C  =  (p)  is  trivial  —  just  like  k  in  the 
previous  section.  To  learn  something  about  the  fluctu¬ 
ations  we  must  study  quadratic  quantities  such  as  the 
Spectrum. 

Continuing  with  the  £)  =  0  assumption,  we  now 
make  a  standard  calculation  of  the  spectrum  by  ensem- 
ble  averaging  the  density  in  (23).  The  Fourier  series 
representation  of  (23)  is 

/>(x,  t)  =  T-2  pk  exp  (ik-  x)  ,  (25) 

k 

where 

F 

Pk  =  ^  n/  exp  i-ik-Xf)  .  (26) 

/=i 

The  spectrum  is  related  to 

F 

(PkPk)  =  (  E 2 5^ E ,  (27) 
/=i  /'>/  ' 

where  the  ()  is  an  ensemble  average  over  both  the  ran¬ 
dom  and  uncorrelated  positions  x/  of  the  surviving  fam¬ 
ilies  and  over  the  random  variable  n/.  Because  there  are 


^The  family  locations  xy  are  uniformly  distributed  in  the  do¬ 
main.  Thus  ((5(x-xy))  =  1,-2  and  if  k  5^  0  then  {exp(ik-xy))  = 


128 


YOUNG 


Figure  3.  Comparison  of  the  analytic  prediction  (31)  (the 
smooth  solid  curve)  with  the  results  of  simulation.  We  show 
estimated  spectra  (the  three  dotted  curves)  from  the  simula¬ 
tion  att  =  l,  f  =  10  and  t  =  100;  the  comparison  with  (31) 
is  made  only  at  t  =  100.  The  simulation  used  No  =  10®  bugs 
with  A  =  2  and  p  =  q  =  7/64.  The  spectra  are  estiamted 
by  azimuthally  averaging  in  the  wavenumber  plane. 

no  correlations  between  the  positions  of  the  families  the 
final  term,  involving  the  double  sum  with  f  ^  f*  van¬ 
ishes.  Using  F  from  (20),  and  from  (22),  shows  that 
the  remaining  term  on  the  right  hand  side  of  (27)  is 

(A>kPk)  =  Fn2  =  JVo(l  +  2pLt) .  (28) 

This  simple  calculation  indicates  that  the  spatial  sig¬ 
nal  of  a  diffusionless  Galton- Watson  process  is  a  white 
spectrum  which  rises  at  a  constant  rate. 

How  is  the  conclusion  above  affected  by  diffusion? 
With  D  ^  0  the  members  of  a  family  move  as  indepen¬ 
dent  random  walkers  and  the  family  towers  blur  into 
spreading  clouds  of  bugs.  We  could  write  down  a  micro¬ 
scopic  density,  analogous  to  (23),  in  which  the  position 
of  each  individual  bug  is  represented  with  a  ^-function. 
But  then  the  calculation  foimders  at  (27)  because  the  () 
no  longer  kills  the  off-diagonal  terms:  there  are  correla¬ 
tions  between  the  positions  of  bugs  which  belong  to  the 
same  family.  Instead  we  can  argue  heuristically  that 
the  evolution  of  the  spectrum,  defined  as 

S{k^t)  =  ATq  ^(pkPk)j  (29) 

is  given  by 

St  =  2/i  -  2Dk^S ,  S(fe,  0)  =  1 ,  (30) 

where  k  =  Vk  k.  The  first  term  on  the  right  hand 
side  of  (30)  is  a  source  of  variance  which  produces  the 
uniform  rise  of  the  spectral  level  in  (28).  The  the  final 


term  is  the  diffusive  sink  of  variance  which  ultimately 
balances  the  source. 

The  solution  of  (30) 

exhibits  a  range  at  high  wavenumbers.  (If  we 
isotropize  by  using  27rA:S(A:)  then  this  is  a  spec¬ 
trum.)  Figure  3  shows  good  comparison  of  (31)  with  a 
spectrum  estimated  from  a  simulation, 

5.  Conclusions 

In  this  note  we  have  studied  a  model  which  focuses 
attention  on  the  role  of  fluctuations  in  the  spatial  de¬ 
velopment  of  populations.  It  is  remarkable  that  Euler, 
Navier  and  Stokes  formulated  the  basic  equations  of 
fluid  mechanics  before  the  reality  of  atoms  was  estab¬ 
lished.  This  history  indicates  that  for  most  purposes 
atomic  fluctuations  are  unimportant  in  fluid  mechanics. 
The  Brownian  bug  model  suggests  that  the  problem  of 
biological  fluctuations  is  more  severe,  and  that  events 
on  microscopic  scales  can  impact  macroscopic  observa¬ 
tions,  Indeed,  the  source  term  2//  on  the  right  hand 
side  of  (30)  is  due  to  reproduction  and  has  no  analog  in 
fluid  mechanics. 

Appendix:  A  recipe  for  simulations 

The  initial  condition,  t  =  0,  is  prepared  with  a  large 
number  iVo  of  bugs,  strewn  randomly  into  an  L  x  L 
square  domain.  The  initial  condition  is  a  Poisson  point 
process  in  which  there  are  no  correlations  between  the 
positions  of  the  bugs. 

The  simulation  is  advanced  through  time  in  incre¬ 
ments  of  a  “cycle”  of  duration  r.  Each  cycle  consists 
of  two  steps:  (a)  random  birth  and  death;  (b)  Brow¬ 
nian  motion.  In  step  (a)  each  bug  reproduces  by  bi¬ 
nary  fission  (probability  p)  or  dies  (probability  q)  or 
remains  imchanged  (probability  1  —  p  —  q).  When  a 
lucky  bug  divides  the  offspring  is  placed  on  top  of  the 
parent.  In  step  (b),  bug  k  is  displaced  to  a  new  position 
-h  r)  =  Xfc(t)  H-  (5xfc(t).  The  components  of  (Jx^  are 
independent  and  identically  distributed  Gaussian  ran¬ 
dom  variables,  each  with  rms  value  A  (i.e.,  Brownian 
motion  with  the  diffusivity  D  =  A^/2r).  These  inde¬ 
pendent  displacements  separate  the  coincident  parent- 
progeny  pairs  created  by  step  (a). 

In  principle,  one  can  approach  the  continuous  limit 
by  taking  r  — >  0  while  holding  the  parameters  A  =  p/r, 
p  =  q/r^  and  D  =  A^/2t  fixed. 

We  can  also  use  po(^)  in  (18)  to  calculate  the  prob¬ 
ability  that  an  initial  population  of  Nq  bugs  becomes 
extinct  at  time  t\  each  bug  produces  a  family  which 
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evolves  independently  of  the  other  families,  so  that  the 
probability  that  all  Nq  families  are  extinct  at  t  is 

Pext  =  Po°  «  exp(-rext/t)  ,  (Al) 

where  the  extinction  time-scale  is  Text  =  No/fx.  (The 
exponential  approximation  in  (Al)  assumes  that  Nq  is 
large  and  tfT^xt  is  of  order  umty.)  In  simulations  we 
avoid  extinction  by  making  Nq  much  larger  than  the 
number  of  generations.  The  suppression  of  extinction 
is  a  consequence  of  approaching  the  “thermodynamic 
limit”  in  which  Nq  ^  oo  with  Nq/L^  fixed. 
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Abstract.  Univariate  and  bivariate  procedures  for  investigating  the  properties  of 
single  and  joint  intermittent  stochastic  processes  are  presented.  They  allow  the 
characterization  of  all  the  statistics  of  intermittent  variables  using  a  set  of  three 
basic  parameters  in  the  multiffactal  framework,  whatever  the  scales  and  the  in¬ 
tensity.  The  multifractal  formalism  is  then  extended  to  more  than  one  variable 
to  investigate  the  degree  of  dependence  among  random  fields  by  examining  the 
nature  of  their  joint  distribution.  These  formalisms,  which  do  not  require  any 
assumption  about  the  spectrum  or  the  probability  distribution  of  the  data  sets 
under  consideration,  are  finally  illustrated,  first,  by  studying  the  properties  of 
turbulent  velocity  fluctuations  recorded  in  a  laboratory  grid-generated  turbu¬ 
lence  experiment  and  in  the  ocean  using  a  high-frequency  free-fall  profiler.  Sec¬ 
ond,  we  investigate  the  distributions  of  temperature,  salinity  and  in  vivo  fluores¬ 
cence  (a  proxy  of  ph5doplankton  biomass)  time  series  recorded  in  the  coastal 
waters  of  the  Southern  Bight  of  the  North  Sea,  as  well  as  their  potential  correla¬ 
tion. 


Introduction 

Considering  the  current  awareness  of  the  intermittent 
nature  of  both  physical  and  biological  patterns  and  proc¬ 
esses  in  marine  sciences  (Gibson,  1991;  Pascual  et  al., 
1995;  Seuront  et  al,  1996a,  1999;  Jou,  1997;  Jimenez, 
2000),  and  the  emergence  of  hot  topics  such  as  those  re¬ 
lated  to  thin-layer  properties  (Cowles  et  al,  1998;  Osborn, 
1998),  there  is  a  real  need  to  focus  on  the  precise  nature  of 
both  single  intermittent  variables  and  the  degree  of  correla¬ 
tions  between  joint  intermittent  variables,  especially  those 
related  to  physics  and  biology. 

Recent  theoretical  and  empirical  developments,  con¬ 
ducted  both  on  turbulent  energy  dissipation  rates  and  on 
the  associated  passive  scalars  distributions,  have  demon¬ 
strated  the  multiscale  properties  of  intermittent  processes 
(Pascual  et  al,  1995;  Seuront  et  al,  1996a,  b,  1999, 
2001a,  b;  Lovejoy  et  al,  2001).  The  large  numbers  of 
modeling  approaches  devoted  to  the  structure  of  turbulent 
flows  have  only  provided  description  at  a  very  limited 
range  of  scales.  Indeed,  neither  Gaussian  diffusion  models 
(Visser,  1997)  nor  direct  numerical  simulations  (Yamazaki 
et  al,  1991)  possess  a  way  to  change  the  scale  upward  or 


downward.  In  contrast,  scale  change  is  natural  for  multi- 
fractal  processes  (Seuront  et  al,  1999).  Moreover,  Gaus¬ 
sian  processes  are  determined  by  only  two  moments,  while 
the  first  three  moments  of  empirical  processes  are  usually 
provided  to  characterize  them.  In  the  statistical  framework 
of  multifractals,  we  propose  a  more  global  approach  since 
we  provide  all  the  moments  (even  non-integers),  giving  as 
much  statistical  information  as  the  study  of  probability 
density  directly. 

The  procedures  used  to  test  for  independence  between 
two  processes,  based  on  second  order  statistics  analysis 
(i.e.,  correlation),  become  fallacious  for  intermittent  non- 
Gaussian  fields.  It  is  then  necessary  to  test  for  independ¬ 
ence  by  studying  joint  moments  of  all  orders.  In  addition, 
more  recent  procedures  based  on  examination  of  probabil¬ 
ity  density  functions  do  not  deal  with  the  multiscaling 
properties  of  intermittent  fields. 

In  this  paper,  first  we  define  the  concept  of  intermit- 
tency  and  then  review  briefly  the  multifractal  framework. 
Second,  we  will  introduce  an  innovative  objective  tech¬ 
nique  for  determining  if  two  stochastic  processes  can  be 
regarded  as  being  independent  or  not,  and  for  investigating 
the  nature  of  their  potential  coupling.  Finally,  we  provide 
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some  illustrative  case  studies  demonstrating  how  these 
multi  fractal  formalisms  can  improve  our  understanding  of 
the  structures  and  functions  in  marine  systems. 


Intermittency 

The  concept  of  intermittency  has  been  initially  de¬ 
scribed  in  the  framework  of  turbulence,  where  experimen¬ 
tal  records  of  the  kinetic  energy  dissipation  rate  £  showed 
strong  and  unpredictable  bursts  instead  of  a  steady  behav¬ 
ior  of  the  fluctuations  (Fig.  1).  It  has  thus  been  known  for  a 
long  time  that  the  assumption  of  a  spatially  uniform  turbu¬ 
lent  cascade  used  by  Kolmogorov  (1941)  is  not  satisfied 
for  the  increments  of  turbulent  velocity  at  distances  in  the 
inertial  subrange  or  shorter,  and  that  strong  gradients  are 
more  common  than  they  would  be  for  a  Gaussian  distribu¬ 
tion  {Batchelor  and  Townsend^  1949).  This  means  that 
most  of  the  time  the  gradients  would  still  be  of  the  order  of 
their  standard  deviation,  but  that  occasionally  we  should 
expect  stronger  burst,  much  more  often  than  in  the  Gaus¬ 
sian  case.  This  is  the  phenomenon  of  intermittency,  which, 
as  with  other  turbulent  phenomena,  is  found  at  all  scales. 

This  may  be  understood  from  a  geometrical  point  of 
view  considering  that  the  turbulent  regions  do  not  fill  the 
whole  space,  but  only  a  subpart,  in  a  very  irregular  way. 
This  erroneous  picture  of  a  non-space  filling  turbulence 
takes  the  intermittent  nature  of  turbulence  into  account  by 
assuming  that  subeddies  in  the  inertial  subrange  are  either 
‘dead’  (inactive)  or  ‘alive’  (active),  and  leads  to  a  (mono-) 
fractal  description  of  turbulence.  However,  it  is  now 
known  that  both  turbulent  velocity  and  passive  scalars 
fluctuations  are  intermittent  in  the  sense  that  strong  (i.e. 
‘more  active’)  subeddies  occupy  tiny  fractions  of  the  space 
available.  This  led  to  a  multifractal  description  of  turbu¬ 
lence,  each  intermittency  level  being  associated  with  its 
own  fractal  dimension  [see  Frisch  (1995)  for  more  details]. 

In  the  following  we  quickly  review  how  to  characterize 
the  statistics  of  a  multiffactal  intermittent  field. 

Investigating  single  intermittent  distributions: 
a  brief  review  of  the  multifractal  framework 

Spectral  analysis  has  been  widely  used  in  ecology  to 
separate  and  measure  the  amount  of  variability  occurring 
at  different  wavenumbers.  In  particular,  assuming  local 
isotropy  and  three-dimensional  homogeneity  of  turbulence 
in  the  inertial  subrange,  the  velocity  fluctuations  and  the 
fluctuations  of  a  passive  scalar  quantity  P  can  be  de¬ 
scribed  in  Fourier  space  by  the  spectral  densities  Ey  {k) 
and  Ep{k) : 


Ey(k)ock-^‘ 


(1) 


Ep(k)ock-^^  (2) 

where  /:  is  a  wavenumber,  and  with  py  =  =5/3  for  a 

non-intermittent  turbulence  {Kolmogorov^  1941;  Obukhov^ 
1941,1949;  Corrsin,  1951).  However,  spectral  analysis  and 
related  techniques  are  implicitly  based  on  Gaussian  hy¬ 
pothesis  untenable  in  an  intermittent  framework  {Seuront 
et  ai,  1999,  2001a).  Thus,  while  random  variability  has 
been  often  modeled  in  marine  ecology  in  the  Gaussian 
framework,  through,  e.g.,  Gaussian  distributions  and 
Brownian  motion  (e.g.,  Yamazaki  and  Okubo,  1995;  Vis- 
ser,  1997),  we  will  here  generalize  this  approach  fully  tak¬ 
ing  into  account  the  intermittency  of  turbulent  velocity  and 
passive  scalar  fluctuations. 

Under  fairly  general  conditions,  the  probability  distribu¬ 
tion  of  a  random  variable  X  is  equivalently  specified  by 
its  statistical  moments.  Here  we  consider  the  moments  of 
the  increments  of  a  random  field  at  all  scales;  this  corre¬ 
sponds  to  the  introduction  of  the  structure  functions  expo¬ 
nents  l^y  {q)  and  t^p{q) ,  which  describe  the  multiscaling 
of  the  statistical  moments  of  order  q  of  turbulent  velocity 
and  passive  scalar  fluctuations: 


(4) 

where  /  is  the  spatial  scale  and  the  angle  brackets  “  Q  ” 

indicate  ensemble  averaging.  For  monoscaling  (i.e.,  mon¬ 
ofractal)  processes,  the  function  ^{q)  is  linear: 

C(q)  =  ql2  for  Brownian  motion,  and  ^(q)  =  q/3  for 

Kolmogorov-Obukhov-1941.  For  multiscaling  processes, 
this  exponent  is  non-linear  and  concave. 

While  the  functions  ^y{q)  and  ^^(q)  could  depend  on  a 

ver>'  large  number  of  parameters  (a  priori  an  infinite  num¬ 
ber  of  both  integer  and  non-integer  values  of  q ),  in  the 

framework  of  universal  multifractals  they  can  be  deter¬ 
mined  by  only  three  parameters,  with  the  general  form 
^{q)=  Aq  +  Bq“  (Schertzer  and  Lovejoy,  1987).  For  turbu¬ 
lent  velocity,  the  normalization  conditions  corresponding 
to  the  conservation  of  turbulent  kinetic  energy  within  the 
inertial  subrange  give  ^^(3)  =  1 ,  so  that  the  resulting  func¬ 
tion  can  be  written  as 


a,  "X 

3 

a,-l 

1  3 

y 

(5) 


For  a  passive  scalar  P ,  there  is  no  such  a  relation,  and  we 
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must  keep  three  parameters,  writing  the  expression  as 
{Seurontetal,  1996a,  b,  1999,2001a) 

=  (6) 
Wp  i 

H  is  the  degree  of  non-conservation  of  the  average  proc¬ 
ess;  //  =  0  for  a  conservative  process  (i.e.,  scale- 
independent)  and  //  0  for  a  non-conservative  process 
(i.e.,  scale-dependent).  H  is  given  by  //  =  ^^(1) ,  while  it 

can  be  seen  from  Eq.  (5)  that  ^^(1)  >1/3 .  C,  is  the  codi¬ 
mension  that  characterizes  the  intermittency  of  the  process 
and  satisfies  0  <  C,  <  is  the  dimension  of  the  obser¬ 
vation  space):  C,  =  0  for  a  homogeneous  process,  and  C, 

increases  with  the  intermittency  of  the  process,  indicating 
that  the  field  values  corresponding  to  any  given  level  of 
variability  are  more  scarce.  The  Levy  index  a  is  the  de¬ 
gree  of  multifractality,  bounded  between  a  =  0  and 
a  =  2 ,  corresponding  to  the  monofractal  case  and  to  the 
maximum,  or  log-normal,  multifractal  case,  respectively. 

In  other  words,  the  second  terms  of  Eqs.  (5)  and  (6) 
express  the  multiffactal  intermittent  deviation  from  mon- 
ofractality,  in  which  case  l^y{q)  =  ql2>  and  ^p(q)  =  qH  . 

The  “distance”  between  monofractality  and  multifractality 
is  then  a  function  of  C,  and  a .  The  knowledge  of  these 

parameters  is  enough  to  characterize  all  the  one-point  sta¬ 
tistics  of  the  fields.  In  particular,  we  see  from  Eqs.  (5)  and 
(6)  that  C,  is  given  by  C,j=l-3^j(3)  for  velocity  fluctua¬ 
tions,  and  C,j  =  //-^p(l)  for  a  passive  scalar  quantity  P. 
a  and  a  are  estimated  as  the  best  nonlinear  fit  of  Eqs.  (5) 
and  (6)  for  values  between  0  and  2,  using  a  simplex  proce¬ 
dure. 

Finally,  we  note  here  that  in  the  multiscaling  frame¬ 
work,  intermittency  is  taken  into  account  noting  that 

Py=\  +  CA2)  (7) 

Py=\  +  U2).  (8) 

We  thus  see  that  the  intermittency  corrections  introduced 
by  the  second  term  of  Eq.  (5)  lead  to  p,,  >  5/3  .  Further¬ 
more,  in  order  to  conduct  direct  quantitative  comparisons 
between  the  velocity  and  passive  scalar  intermittent  correc¬ 
tions  associated  to  C,  values,  let  us  note  that  the  coeffi¬ 
cients  in  front  of  the  non-linear  terms  in  Eqs.  (5)  and  (6) 
are  similar  only  if  we  compare  C^y  and  C,p ,  with  C,^ 
estimated  as  (Schmitt  et  al,  1996;  Seuront  et  al.,  1996a) 

C„=3-“'C„.  (9) 


Investigating  joint  intermittent  distributions: 
the  “Generalized  Correlation  Functions” 


Previous  standard  procedures  devoted  to  test  for  inde¬ 
pendence  between  two  given  processes  were  generally 
based  on  second  order  statistics  (i.e.,  covariance  and  corre¬ 
lation  functions),  even  when  they  were  conducted  in  scal¬ 
ing  framework  related  to  spectral  analysis  (Legendre  and 
Legendre,  1984)  or  geostatistical  analysis  (Kitanidis, 
1997).  More  recent  procedures  are  based  on  probability 
density  functions  examination  (Lueck  and  Walk,  1999). 
The  former  are  implicitly  based  on  Gaussian  hypothesis 
(uncorrelation  implying  independence),  untenable  in  an 
intermittent  framework  characterized  by  a  high-order  sta¬ 
tistical  behavior.  The  latter  do  not  deal  with  the  intrinsic 
multiscaling  properties  of  intermittent  fields.  We  then  pro¬ 
pose  here  an  original  testing  procedure  based  on  a  high- 
order  generalization  of  the  correlation  concept  between 
two  variables  X  and  Y . 

Instead  of  random  variables  X  and  Y ,  we  consider 
here  two  stochastic  processes  AA',  and  AY,  (e.g.,  Parzen, 

1962).  For  more  convenience,  let  us  note  x  =  AX,=e'^  and 
y=AY,  =  e°- .  The  joint  moments  can  be  written  as  the 
moments  of  a  vectorial  process: 

(;c  V)  =  oc  /^'e)  (10) 


where  the  vectors  Q  and  G  are  given  by  ^  =  {p,q)  and 
G  =  (G„G2),  and  fhe  exponents  S(Q)  characterize  the 
multiscaling  properties  of  the  joint  moments  .  The 

“Generalized  Correlation  Function”  (GCF  hereafter)  we 
introduce  is  then  simply  a  normalization  of  the  joint  mo¬ 
ments  following: 


c(p,q) 


«v) 

(«'>{>■'> 


(11) 


The  “Generalized  Correlation  Exponent”  (GCE  hereafter), 
estimated  as  the  slope  of  the  linear  trend  of  c(p^q)  vs.  / 
in  a  log-log  plot,  is  then  expressed  as 

^(P^^)  =  Cx(P)^^y(^yS(p,q)  (12) 

where  Cx(P)  ^ind  ^y(q)  characterize  the  multiscaling 
properties  of  the  single  fluctuations  and  as  de¬ 
fined  in  Eqs.  (3)  and  (4),  and  S(p,q)  characterizes  the 
multiscaling  properties  of  the  joint  fluctuations 
see  Eq.  (1 1).  Both  c{p,q)  and  r(p,q)  are  generalization  of 
correlation  functions.  They  express  the  correlation  between 
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and  and  their  scale  and  moment  dependence.  In  the 
particular  case  p-q-\^  Eq.  (11)  recovers  the  classical 
expression  of  the  correlation  coefficient  between  x  and 
We  nevertheless  need  to  emphasize  here  that,  whereas 

independence  implies  uncorrelation,  uncorrelation  does  not 
imply  independence. 

Indeed,  uncorrelation  corresponds  simply  to  the  relation 
r(l,l)  =  0.  Uncorrelation  implies  independence  only  in 

special  cases  such  as  for  Gaussian  processes.  To  show  this, 
let  us  consider  the  joint  scaling  function  for  lognormal 
multifractals  x  and  y.  Using  results  for  multivariate 

Gaussian  processes  (see  any  text  book  on  multivariate  sto¬ 
chastic  processes,  e.g.,  Samorodnitsky  and  Taqqu,  1994), 
one  has  the  general  expression  for  a  lognormal  process: 

q)  =  a,p-¥  a,q  -  a^q"  -a^p^-Gpq  (13) 

so  that 

r{p.q)^S{p,0)^S{Q,q)-Sip,q)  (14) 

giving 

r(p,q)  =  Opq.  (15) 

In  this  case,  it  is  clear  that  r(l,l)  or  r(2,2)  is  enough  to 

estimate  the  only  parameter,  namely  the  correlation  coeffi¬ 
cient  a,  so  that  if  r(l,l)  =  0  or  r(2,2)  =  0  it  can  be  con¬ 
cluded  that  the  two  processes  are  independent.  In  the  gen¬ 
eral  case,  this  is  no  more  true:  independence  between  the 
stochastic  processes  x  and  y  means  that  the  GCE  verifies 

''(P>^)  =  0  whatever  the  values  of  p  and  q  ,  while  uncor¬ 
relation  corresponds  to  r(l,l)  =  0 . 

A  function  related  to  our  c{p,q)  has  also  been  inde¬ 
pendently  introduced  in  the  lognormal  multifractal  frame¬ 
work  for  financial  time  series  by  Muzy  et  al  (2000),  where 
it  was  called  “joint  scaling  exponent”. 

The  function  c{p,q)  and  its  related  scaling  exponent 
KPiR)  can  be  used  as  an  analysis  tool  to  study  some  com¬ 
plex  relationship  between  two  multifractal  fields  x  and  y . 

To  provide  some  basis  for  discussion  and  interpretation  of 
experimental  results,  let  us  consider  some  limit  cases. 

If  X  and  y  are  independent,  as  was  said  above, 
r(/?,^)  =  0.  On  the  other  hand,  in  case  of  perfect  propor¬ 
tionality  X  =  K.y ,  where  A”  is  a  constant,  or  for  “random 
proportionality”  x  =  K.y ,  where  K  is  a  random  variable 
independent  on  y ,  it  is  readily  seen  that 

r{P,<])  =  ^r(P)  +  ^r(.^)-^YiP  +  ^)-  (16) 

In  particular,  one  may  note  that  r{p,q)>0  due  to  the  con¬ 
vexity  of  the  scaling  functions  ^(p) .  This  relation  can  be 


directly  tested  to  verify  the  random  proportionality  hy¬ 
pothesis.  Furthermore,  the  shape  of  the  surface  obtained  is 
symmetric  in  the  p-q  plane. 

Other  simple  situations  may  be  considered:  if  x  =  K.y^ 
with  b>0  and  K  constant,  or  if  x  =  x./  with  K  random 
and  independent  of  y ,  then  one  has 

rip,q)  =  CY  {bp )  +  Cy  (q)  -  ?y (^P  +  ^) .  (1 7) 

Eq.  (17)  is  still  positive,  but  no  more  symmetric  in  the  p-q 
plane;  it  is  symmetric  in  the  bp-q  plane.  In  this  framework, 
the  value  of  b  may  be  first  estimated  as  the  positive  value 
such  that 

r{pfi)^r{0M  (18) 

Using  the  value  estimated  this  way,  this  framework  is  then 
tested  by  verifying  that  r{plb,q)  is  indeed  symmetric  in 
the  p-q  plane.  More  generally  speaking,  the  more  r{p,q)  is 
positive,  the  more  the  x  =  AXf  and  y  =  are  dependent 
random  variables. 

The  main  advantages  of  this  framework  are  the  follow- 
ing:  (i)  it  makes  no  assumptions  about  the  spectrum  or  the 
distribution  of  either  data  sets;  (ii)  it  takes  fully  into  ac¬ 
count  their  intrinsic  multiscaling  properties;  (iii)  it  may  be 
used  to  provide  much  more  fundamental  relations  between 
two  pattems/processes;  and  iv)  more  generally  it  fully  ex¬ 
plores  qualitatively  and  quantitatively  the  correlations  of 
the  fields  x^  and  y*^ ,  revealing  the  correlations  of  large 
and  low  fluctuations  of  both  processes. 

Turbulent  velocity  fields 
Turbulence  measurements 

Laboratory  experiments 

Turbulence  has  been  generated  by  means  of  fixed  PVC 
grids  (diameter  2  mm,  mesh  size  1  cm)  in  a  circular  flume. 
Considering  the  dimensions  of  the  flume  and  the  flow  ve¬ 
locities  considered  in  the  experiment  (Table  1),  we  were 
dealing  with  a  fully  developed  turbulence.  A  high- 
frequency  (100  Hz)  time  series  of  instantaneous  horizontal 
turbulent  velocity  was  measured  by  hot-film  velocimetry 
10  and  20  cm  behind  the  grid.  The  average  turbulent  en¬ 
ergy  dissipation  rate  was  derived  following  the  isotropic 
formula  {Tennekes  and  Lumley,  1972): 

£  =  15v£'“'A^£(*)dA'  (19) 

where  8  is  the  turbulent  energy  dissipation  rate  (m^s’’), 
k  the  wavenumber  {k^lnlX,  m  '),  X  the  eddy  wave- 
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length  (m),  and  respectively  the  smallest  and 
largest  wavenumber  resolved,  and  E{k)  the  turbulence 
spectrum  (m  s"“).  The  spectrum  E{k)  is  regarded  as  the 

mean-square  amplitude  of  velocity  fluctuations  associated 
with  a  wavenumber  of  turbulent  motion. 

Table  1.  Main  characteristics  of  the  grid-generated  turbulence 
experiment  {d\  distance  from  the  grid;  Re:  Reynolds  number;  N: 
numbers  of  time  series  analyzed). 


d  (cm) 

Speed  (m  s'*) 

e  (m“  s'^) 

Re 

N 

20  cm 

0.15 

2.86  10'* 

45  10’ 

5 

0.20 

3.54  10-* 

60  10’ 

5 

0.25 

9.14  10-^ 

75  10’ 

5 

0.30 

1.36  10-’ 

105  10’ 

5 

0.35 

1.64  10'^ 

105  10’ 

5 

0.40 

1.82  10'^ 

120  10’ 

5 

10  cm 

0.15 

1.01  lo-’ 

45  10’ 

4 

0.20 

1.24  10-^ 

60  10’ 

5 

0.25 

3.2  10-® 

75  10’ 

5 

0.30 

4.75  10-* 

105  10’ 

5 

0.35 

5.75  10-^ 

105  10’ 

5 

0.40 

6.36  10-* 

120  10’ 

5 

To  investigate  potential  differences  in  the  structure  of 
velocity  fields  in  relation  to  the  different  levels  of  the  tur¬ 
bulent  kinetic  energy,  we  analyzed  here  5  time  series  of  1 
min  duration  (i.e.,  6000  data  points)  for  5  flow  velocities  at 
the  two  locations  considered  behind  the  grid. 

Field  experiments 

Measurements  of  the  turbulence  fields  were  done  using 
TurboMAP  (Turbulence  Ocean  Microstructure  Acquisition 
Profiler),  a  free-fall  integrated  system  housed  in  an  HCR 
treated  aluminum  pressure  case,  specifically  designed  for 
concurrent  measurements  of  turbulent  parameters  (shear, 
0t//3z,  where  u  and  z  are  the  turbulent  fluctuating  cross¬ 
stream  velocity  component  and  the  depth,  respectively), 
hydrographic  parameters  (pressure,  temperature  and  con¬ 
ductivity)  and  bio-optical  parameters  {in  vivo  fluorescence 
and  backscatter).  All  sensors  are  set  into  a  parabolically 
shaped  cap  at  the  top  of  the  pressure  case.  This  arrange¬ 
ment  ensures  that  these  sensors  point  into  an  undisturbed 
flow.  All  signals  are  simultaneously  sampled  at  256  Hz 
and  stored  internally  in  a  PCMCIA  memory  card.  More 
details  on  the  description  and  evaluation  of  both  design 
and  performance  of  this  instrument  can  be  found  in  Wolk 


etal  (2001  a,b). 

In  this  paper,  we  focus  only  on  the  ability  of  this  new 
high-resolution  biophysical  profiler  to  measure  the  shear 
signal,  dufdz .  In  that  way,  we  use  8  vertical  shear  profiles, 
recorded  on  21  August  1998  in  a  tidal  channel  located  be¬ 
tween  Neko-Seto  and  Meneko-Seto  Straits  (Japan).  The 
sampling  location  has  been  chosen  because  this  area  is 
characterized  by  very  strong  tidal  currents  (up  to  180  cm  s‘ 
),  associated  with  channel  narrowing,  and  a  subsequent 
high  tidal  mixing  {Takasugi  et  al.,  1994a,  b). 

The  turbulent  velocity  fluctuations  are  measured  with  a 
standard  piezo-ceramic  shear  probe,  which  has  extensively 
been  described  elsewhere  (e.g.,  Lueck  et  al,  1997).  The 
probe  is  associated  with  three  orthogonal  accelerometers 
mounted  inside  the  pressure  case  and  is  used  to  quantify 
noise  level  of  velocity  shear  and  to  clean  the  shear  signal 
of  possible  narrow-band  contaminations  from  vehicle  vi¬ 
brations  {Levine  and  Lueck,  1999). 

To  recover  the  turbulent  fluctuating  velocity  u  from 
du  I  dz ,  we  run  an  anti-derivative  digital  filter  over  the 
shear  data,  which  consists  of  a  scaled  single-pole  low-pass 
filter  with  a  cut-off  frequency  .  The  /^.  is  a  function  of 

the  integration  time  scale  as  =1/2.3/,  [see  Wolk  and 
Lueck  (2001)  for  further  details].  Here  we  used  /.  =7.5  s 
and  /.  =  0.06  Hz,  leading  to  a  maximum  wavelength  re¬ 
solved  to  10  m.  The  lowest  frequency  resolved  is  deter¬ 
mined  by  the  high-pass  filtering  of  the  shear  probe  signal 
to  minimize  pyro-electric  effects  (70  Hz). 

The  vertical  distribution  of  the  turbulent  kinetic  energy 
dissipation  rates  £  has  been  estimated,  after  systematic 
denoising  and  despiking,  from  shear  power  spectra  com¬ 
puted  for  512  successive  segments  of  shear  (i.e.,  every 
meter  considering  an  average  profiler  sinking  velocity  of 
0.5  m  s‘^)  following  the  isotropic  formula  defined  above  in 
Eq.  (19). 

Considering  the  extreme  fluctuations  observed  in  the 
vertical  distribution  of  the  dissipation  rates  £  (Fig.  1),  our 
analysis  has  been  conducted  on  all  the  512  data  point  seg¬ 
ments  available  within  each  vertical  profile  (Table  2). 
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Figure  1.  Example  of  the  intermittent  character  of  the  vertical 
distribution  of  the  kinetic  energy  dissipation  rate  £ ,  which  has 
been  estimated  every  meter  using  Eq.  (19). 


Table  2.  Depth,  numbers  of  512  data  points  subsections,  and 
mean  of  the  turbulent  kinetic  energy  dissipation  rate  e  (m‘  s‘^), 
for  each  of  the  eight  vertical  profiles. 


Data  set 

N 

n 

E 

1 

14463 

28 

o 

o 

oq 

2 

20479 

40 

1.87  lO-’ 

3 

21887 

43 

6.98  10* 

4 

18431 

36 

6.43  10-* 

5 

10495 

20 

9.61  10* 

6 

22911 

45 

1.33  10‘‘ 

7 

29949 

58 

3.72  10-’ 

8 

17279 

34 

5.65  10'* 

Characterizing  turbulent  velocity  fields 

The  velocity  spectra  exhibit  a  scaling  behavior  ranging 
between  0.1 85  ±0.1  m  and  0.013  +  0.015  m  for  velocity 
fluctuations  measured  in  our  circular  flume,  and  between 
1.010.1  m  and  0.05010.005  m  for  the  velocity  fluctua¬ 
tions  estimated  from  shear  data  recorded  in  the  field  (Fig. 
2),  These  scaling  behaviors  are  qualitatively  close  (we  try 
a  quantification  later)  to  the  theoretical  -*5/3  Kolmogorov’s 
power  law  over  roughly  1.1  and  1.3  decades,  respectively, 
with  Py  « 1.70  in  both  cases.  The  ranges  of  scales  defined 
that  way  will  be  regarded  as  the  inertial  subrange. 

The  estimates  of  the  velocity  structure  functions 
((AK/)^)  confirm  the  scaling  regimes  previously  shown  by 
spectral  analysis  for  different  orders  of  moment  q  (Fig.  3). 
Estimated  scale  ratios,  highest  and  lowest  wavenumbers  of 
scaling  ranges  observed  for  the  512  data  points  subsections 
considered  within  each  profile,  exhibit  very  similar  pat¬ 
terns  to  those  observed  in  the  spectral  framework.  We 


refer  the  reader  to  Table  1  for  further  details.  We  subse¬ 
quently  computed  the  scaling  exponents  f  {q)  for  several 

values  of  q  between  0  and  10  (i.e.  with  0.1  increments) 
for  the  range  of  scales  where  C^'(3)  =  l  is  valid;  this  range 
of  scales  characterizes  here  the  inertial  subrange.  The  scal¬ 
ing  of  the  second-order  moments  ^^(2)  confirms  the  esti¬ 
mates  of  the  spectral  exponents  Py  [Py  =  \  +  ^y(2) ,  Eq. 
(7)]:  Cf(2)  =  0.71010.010  for  grid-generated  turbulence 
and  ^y{2)  =  0.705 1 0.005  for  ocean  turbulence. 


Figure  2.  Samples  of  the  power  spectral  density  E(k)  of 
turbulent  velocity  fluctuations  recorded  with  a  hot-film 
velocimeter  (A)  located  behind  a  grid  in  a  circular  flume, 
and  (B)  in  a  tidally  mixed  tidal  channel,  shown  in  log-log 
plots  as  a  function  of  the  wavenumber  k  (m"’).  The  vertical 
dotted  lines  indicate  the  inertial  subrange  over  which  the 
spectra  roughly  follow  the  power-law  form  E{k)ock'^'  , 
with  Py-5/3  (continuous  lines). 
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Figure  3.  The  velocity  structure  functions  of  ocean 

velocity  versus  the  length  scale  /  in  a  log-log  plot  for  various 
values  of  the  statistical  order  of  moment  q  (i.e.,  ^=1,2  and  3, 

from  top  to  bottom).  Linear  trends  are  clearly  visible,  whatever 
the  order  of  moment  q  (continuous  lines),  and  indicate  the  mul¬ 
tiscaling  nature  of  our  datasets.  The  vertical  dotted  lines  corre¬ 
spond  to  the  upper  and  lower  limits  of  the  inertial  subrange.  Note 
that  the  different  scaling  ranges  (estimated  from  the  profile 
shown  in  Fig.  2a)  are  indistinguishable  from  the  one  estimated 
from  spectral  analysis. 

More  generally,  the  clear  nonlinearity  of  the  functions 
(Fig.  4)  is  direct  evidence  for  the  multiffactal  nature 
of  the  velocity  fluctuations.  Moreover,  within  experimental 
error,  the  functions  l^y^q)  for  grid-generated  and  ocean 
turbulent  velocity  fluctuations  cannot  be  qualitatively 
shown  as  being  different.  While  the  similarity  observed  in 
the  shape  of  the  functions  t^y{q)  qualitatively  suggests 
some  convergent  behavior  in  the  high-order  structure  of 
velocity  fluctuations,  the  values  of  the  universal  multiffac¬ 
tal  parameters  C|j  and  lead  to  specifying  these  re¬ 
sults.  These  parameters  have  been  estimated  as 
Cij:  =0.18  +  0.02  ,  and  =  1 .52  ±  0.03  for  grid-generated 
turbulence,  and  C,,  =0.1 6 ±0.02  and  a,  =1.55 ±0.04  for 
ocean  turbulence.  That  leads  to  confirmation  that  the  struc¬ 
ture  of  velocity  fluctuations  recorded  in  the  field  and  in  the 
laboratory  cannot  be  regarded  as  being  significantly  differ¬ 
ent. 

On  the  other  hand,  the  parameters  C,^ ,  estimated  using 
C^,  and  in  Eq.  (9)  as  C,^  =0.032  ±0.02 ,  appear  slightly 
smaller  than  the  values  reported  in  the  literature  for  the 
parameters  C„,  characterizing  the  intermittent  behavior  of 

temperature  and  salinity  in  highly  dissipative  areas: 


Figure  4.  The  scaling  exponents  structure  function  ^y{q)  em¬ 
pirical  curves  (open  dots)  estimated  from  grid-generated  turbu¬ 
lent  velocity  fluctuations,  compared  to  the  monofractal  curve 
C('(9)  =  9/3  (dashed  line),  and  to  the  universal  multifractal 
curve  (thick  continuous  curve)  obtained  with  C,j=0.18  and 
=1.52  inEq.  (5). 

Cjy.  6  [0.035  — 0.060]  and  C,^  €  [0.035  — 0.062]  (Seuront  et  al., 

1996a,b,  1999,  2001a).  This  implies  that  the  temperature 
and  salinity  fields  are  much  more  intermittent  than  the 
velocity  field;  this  high  intermittency  can  be  related  to  the 
strong  discontinuities  (“ramps”)  observed  in  the  tempera¬ 
ture  signals  {Sreenivasan,  1991;  Pumir,  1994b).  Intermit¬ 
tency  for  passive  scalars,  in  both  two  and  three  dimen¬ 
sions,  has  also  been  studied  in  direct  numerical  simulations 
{Holzer  and  Siggia,  1994;  Pumir,  1994a,b).  Such  simula¬ 
tions  often  reveal  extended  regions  of  almost  uniform  tem¬ 
perature  separated  by  boundary  layers  (ramps)  with  strong 
gradients. 

We  should  finally  note  that  the  multifractal  parameters 
C|£  and  obtained  here  for  the  very  first  time  for  oce¬ 
anic  turbulence  are  very  similar  to  the  parameters  esti¬ 
mated  for  atmospheric  turbulence:  C,^  =0.15  ±0.03  and 
a,  =1.50  ±0.05  (Fig.  5;  Schertzer  et  al,  1995;  Schmitt  et 
al,  1992a,  b,  1993,  1996;  Chigirinskaya  et  al,  1994).  As 
previously  noticed  by  Seuront  et  al  (2001b),  these  results 
suggest  a  strong  similarity  of  the  intermittent  nature  of 
turbulent  velocity  fluctuations  in  the  atmosphere  and  the 
ocean.  The  resolution  of  this  particular  issue,  well  beyond 
the  scope  of  the  present  paper,  is  in  progress.  In  particular, 
the  question  is  whether  atmospheric  and  oceanic  tempera¬ 
ture  fields  behave  in  a  similar  fashion  {Seuront  and  Yama- 
zaki,  200 1 ;  Seuront  et  al ,  200 1  c). 
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Figure  5.  Comparison  of  the  empirical  values  of  the  functions 
i^y{q)  estimated  from  grid  generated  turbulence  (open  dots)  and 
field  ocean  turbulence  (open  triangles),  with  the  universal  multi¬ 
fractal  fit  obtained  with  and  values  characteristic  of  at¬ 
mospheric  turbulence,  i.e.  C,^=0.15  and  =1.50  in  Eq.  (5). 
The  linear  curve  expected  in  case  of  non- 

intermittent  turbulence  is  shown  for  comparison  (dashed  line). 

Ocean  passive  and  ‘‘biologically  active” 
scalars 

Phytoplankton  of  the  Eastern  English  Channel 

In  a  recent  paper,  Seuront  et  al  (1999)  showed  that  the 
small-scale  intermittent  distribution  of  temperature  (a  sca¬ 
lar  quantity  passively  advected  by  turbulent  fluid  motions) 
and  phytoplankton  biomass  recorded  in  the  coastal  waters 
of  the  Eastern  English  Channel  were  indistinguishable, 
both  in  spectral  and  multi  fractal  framework  (Fig.  6).  They 
subsequently  concluded  their  study  stating  that  phyto¬ 
plankton  biomass  could  be  regarded  as  a  purely  passive 
scalar  at  small  scales.  Here,  small  scales  correspond  to 
temporal  scales  bounded  between  1  and  20  seconds.  Using 
the  Taylor’s  theory  of  frozen  turbulence,  these  temporal 
scales  can  be  related  to  spatial  scales  bounded  between  0. 1 
and  22  meters.  For  larger  scales,  the  phytoplankton  bio¬ 
mass  exhibits  a  very  specific  heterogeneous  distribution, 
dominated  by  biological  processes  resulting  from  complex 
interactions  between  the  turbulence  level  of  fluid  motions 
(mainly  different  tidal  conditions  related  to  both  ebb/flood 
and  neap/spring  tidal  cycles),  the  phytoplankton  biomass 
concentration,  and  the  specific  composition  of  phytoplank¬ 
ton  assemblages  {Seuront  et  al,  1996b;  Seuront  et  al, 
1999).  These  larger  scales  are  not  investigated  in  the  pre¬ 
sent  work. 


Table  3.  Mean  values  of  the  empirical  estimates  of  the  spectral 
exponent  pj, ,  and  the  universal  multifractal  parameters  H ^ , 

Cyp  and  for  subsets  of  in  vivo  fluorescence  time  series  re¬ 
corded  under  different  levels  of  turbulence. 


e  (m^s'^) 

H 

c, 

a 

I  10-^ 

1.76 

0.43 

0.060 

1,80 

5  10-* 

1.76 

0.43 

0.055 

1.82 

1  10- 

1.77 

0.42 

0.050 

1.80 

5  10'^ 

1.78 

0.42 

0.045 

1.81 

1  10"’ 

1.78 

0.4! 

0.040 

1.80 

Turbulence  and  phytoplankton  patchiness 

Considering  the  role  played  by  turbulence  in  the  forma¬ 
tion  and  disruption  of  particle  aggregates  [e.g.,  Kiorboe 
(1997)  and  references  therein],  and  the  highly  varying  dis¬ 
sipation  rates  of  turbulent  energy  observed  over  a  tidal 
cycle  in  this  area  (between  10‘^  and  10"^  m“  s'^;  Seuront  et 
al,  2001a),  the  question  here  is  whether  the  local  hydro- 
dynamic  conditions  have  an  effect  on  both  temperature  and 
phytoplankton  biomass  local  distributions. 

The  original  46  h  24  min  time  series  recorded  at  1  Hz 
(i.e.,  167040  data;  Seuront  et  al,  1999)  has  then  been  di¬ 
vided  into  46  subseries  of  1  h  (i.e.,  3600  data).  We  then 
built  5  sets  of  7  subseries,  each  set  corresponding  to  a  dif¬ 
ferent  level  of  dissipation  rates  ranging  from  £  =  10'®  m‘.s'^ 
to  £  =  10^  m^.s  We  then  conducted  both  spectral  and 
multiffactal  analyses  on  each  subseries  of  temperature  and 
in  vivo  fluorescence  within  a  given  subset,  and  the  result¬ 
ing  observations  led  to  different  comments  (Table  3). 


Figure  6.  Comparison  of  the  functions  (open  dots) 
and  l^fr{q)  (black  dots)  estimated  from  small  scale  (<20 

seconds)  temperature  and  in  vivo  fluorescence  fluctuations; 
(adapted  from  Seuront  et  al,  1999). 
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The  slopes  and  ,  estimated  for  the  frequency 
bands  that  maximized  the  coefficient  of  determination  and 
minimized  the  total  sum  of  squared-residuals  for  the  re¬ 
gression  in  log-log  plots  of  the  temperature  and  fluores¬ 
cence  power  spectra  [cf  Eq.  (2)],  cannot  be  statistically 
distinguished  within  each  set  of  temperature  and  fluores¬ 
cence  subseries  (analyses  of  covariance,  p>0.05),  nor 
between  the  5  sets  considered  (p>0.05).  These  results, 
fully  congruent  with  previous  studies  conducted  in  the 
same  area  (Seuront  et  aL,  1996a,  b,  1999,  2001a),  suggest 
extreme  similarity  between  the  single  scaling  properties  of 
small-scale  temperature  and  fluorescence  distributions.  At 
this  stage,  phytoplankton  biomass  can  then  still  be  re¬ 
garded  as  being  a  passive  scalar. 

The  nonlinear  behavior  of  the  functions  ^(q)  confirms 
and  generalizes  in  different  hydrodynamic  regimes  the 
multifractal  nature  of  temperature  and  in  vivo  fluorescence 
observed  by  Seuront  et  al  (1999)  using  a  global  approach 
of  the  same  data  set  (Fig.  6).  However,  the  functions  Cr(^) 

obtained  for  temperature  time  series  remain  similar  what¬ 
ever  the  external  hydrodynamic  forcing  (i.e.,  the  values  of 
the  dissipation  rate  8 ),  while  the  nonlinearity  (i.e.,  con¬ 
vexity)  of  the  functions  (^)  characterizing  phytoplank¬ 
ton  biomass  clearly  increases  when  the  value  of  the  dissi¬ 
pation  rate  8  decreases  (Fig.  7). 


0  12  3  4  5 


Figure  7.  Comparison  of  the  empirical  values  of  the  functions 
l^(q)  estimated  for  temperature  (continuous  curve)  and  in  vivo 
fluorescence  for  different  values  of  the  turbulent  kinetic  energy 
dissipation  rate  e  (£  =  10“^  m"  s'\  rhombs;  8  =  5x10^  m^  s"\ 
circles;  8  =  10'^  m“s■^  squares;  8  =  5x10"^  m~  s'\  crosses; 
8  =  10"^  m■s’^  triangles).  The  linear  curve  (j.(q)^qH  ex¬ 
pected  in  case  of  non-intermittent  turbulence  in  shown  for  com¬ 
parison  (dashed  line). 


The  distribution  of  temperature  then  remains  the  same 
whatever  the  hydrodynamic  conditions.  This  result  is  con¬ 
vergent  with  our  previous  observations  regarding  the  dis¬ 
tribution  of  turbulent  velocity  fields,  and  it  confirms  the 
passive  scalar  behavior  of  temperature. 

On  the  other  hand,  the  distribution  of  phytoplankton 
biomass  is  less  intermittent  and  is  closer  to  that  of  tem¬ 
perature  in  high  turbulent  conditions  (Fig.  7).  This  sug¬ 
gests  that  phytoplankton  cells  behave  more  like  a  passive 
scalar  under  high  hydrodynamic  conditions  but  exhibit 
very  specific  properties  under  lower  levels  of  turbulence. 
The  universal  multifractal  parameters  // ,  Cj ,  and  a 

confirm  these  qualitative  observations  (Table  3).  Let  us 
remember  here  that  the  parameters  H ,  C^,  and  a  have 

been  obtained  by  fitting  the  universal  multifractal  model 
detailed  in  Eq.  (6)  to  the  empirical  function  as  de¬ 
scribed  above.  These  parameters  remain  constant  for  tem¬ 
perature  time  series  with  //y.=0.40,  Ci;-  =  0.05  and 

=  1 .90 .  On  the  other  hand,  it  is  shown  that  only  H  and 
Cj  remain  roughly  constant  for  fluorescence,  with 
=0.42  ±0,01  and  1.81  ±0.01  [values  compatible 
with  the  Hj,  =  0.43  and  =  1 .80  found  by  Seuront  et  al 
(1999)],  However,  the  parameter  Q;.  is  highly  variable.  It 
increases  33%  when  the  dissipation  rate  8  decreases  from 
the  highest  to  the  lowest  values  experienced  over  a  tidal 
cycle,  i.e.  from  8  =  10"^  to  8  =  10“^  m“.s■^  Under  low  hy¬ 
drodynamic  conditions  the  higher  values  of  indicate 

the  occurrence  of  few  patches  of  high  phytoplankton  con¬ 
centrations  that  are  several  orders  of  magnitude  above 
background  levels.  Under  higher  hydrodynamic  conditions 
the  lower  values  of  C^p  indicate  that  phytoplankton  cells 

are  more  homogeneously  distributed  and  behave  as  a  pas¬ 
sive  scalar.  Here  the  so-called  homogenization  effect  is 
thought  to  be  associated  with  the  disruption  of  these 
phytoplankton  patches.  This  hypothesis  is  supported  by  the 
C^P  values  observed  for  fluorescence  time  series  under 

high  hydrodynamic  conditions,  which  cannot  be  distin¬ 
guished  from  the  Cyp  values  estimated  for  temperature 

irrespective  of  hydrodynamic  conditions,  and  by  the  ex¬ 
treme  similarity  of  the  empirical  functions  f^(^)  and 
l^p{q)  in  high  hydrodynamic  conditions  (Fig.  7). 

Turbulence  and  biophysical  couplings 

In  order  to  investigate  the  nature  of  the  dependence  be¬ 
tween  temperature  and  phytoplankton  distributions,  we 
subsequently  computed  the  Generalized  Correlation  Func¬ 
tions,  c{p,q)  and  the  related  Generalized  Correlation  Ex¬ 
ponents,  r(/?,^),  between  temperature  and  fluorescence 
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time  series,  within  each  subset  defined  above. 


Figure  8.  The  Generalized  Correlation  Functions  (GCF)  c{p,q) 
versus  the  time  scale  /  in  log-log  plots,  for  (A)  simultaneously 
and  (B)  independently  sampled  temperature  and  in  vivo  fluores¬ 
cence  time  series.  The  function  c{p,q)  shown  here  has  been 
estimated  for  a  constant  value  of  the  statistical  order  of  moment 
q  of  temperature  fluctuations  {q  =  2\  and  various  values  of  the 
statistical  order  of  moment  p  of  in  vivo  fluorescence  (i.e.  p  = 

1,  2  and  3,  from  bottom  to  top).  The  slopes  of  the  linear  regres¬ 
sion  estimated  over  the  scaling  ranges  (dashed  lines)  provide 
estimates  of  the  Generalized  Correlation  Exponents  (GCE) 


Figure  9.  The  Generalized  Correlation  Exponents  (GCE 
r{p,q) ,  shown  as  a  function  of  both  p  and  q  ,  which  charac¬ 
terize  in  vivo  fluorescence  and  temperature  fluctuations,  respec¬ 
tively.  The  function  r{p,q)  is  estimated  here  between  time  series 
of  temperature  and  fluorescence  independently  sampled  (C). 

Figure  8  shows  the  GCF,  c{p,q) ,  plotted  in  log-log 
plots  versus  the  time  scale  t ,  for  simultaneously  recorded 
temperature  and  fluorescence  time  series  (Fig,  8a),  as  well 
as  for  temperature  and  fluorescence  time  series  taken  at 
different  moments  of  the  tidal  cycle,  and  a  fortiori  inde¬ 
pendent  (Fig.  8b).  Both  the  linear  behavior  of  the  functions 
c{p,q)  over  scales  ranging  between  1  and  20  seconds,  and 
the  positive  values  taken  by  the  GCE,  r{p,q) ,  estimated  as 
the  slope  of  the  linear  trend  of  the  GCF  c(p,q)  vs.  f  in  a 
log-log  plot,  whatever  the  combinations  of  p  and  q  val¬ 
ues  (Fig.  8a)  confirm  the  validity  of  Eq.  (1 1),  On  the  other 
hand,  the  weak  values  taken  by  the  functions  c{p,q)  esti¬ 
mated  between  independent  temperature  and  fluorescence 
time  series  (Fig.  8b)  indicate  a  low  correlation  between  the 
temperature  and  phytoplankton  biomass  fluctuations, 

^  and  ^(AF/  Y  ^  .  This  is  confirmed  by  the  related 

values  of  the  functions  r{p,q) ,  which  remain  close  to  zero, 
whatever  the  combinations  of  p  and  q  values  (Fig.  9). 

We  subsequently  refined  these  observations,  comparing 
the  functions  r{p,q)  obtained  between  temperature  and 
fluorescence  time  series  in  different  turbulent  conditions. 
Figure  10  then  shows  the  r{p,q)  obtained  for  all  the  com¬ 
binations  of  p  and  q  values  with  0.1  increments,  for  three 
levels  of  turbulence  ( 10“%  10“^  and  10“®  m^  s‘*^).  It  appears 
that  the  correlation  between  temperature  and  in  vivo  fluo¬ 
rescence  fluctuations  increases  with  hydrodynamic  condi¬ 
tions  and  is  weaker,  even  nil,  in  low  turbulent  conditions. 

We  confirm  here  the  increased  physical  control  sug¬ 
gested  under  strong  turbulent  conditions  from  the  analysis 
of  the  shape  of  the  function  its  comparison 
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with  the  function  ;  see  Figure  7.  On  the  other  hand, 
the  decorrelation  between  temperature  and  phytoplankton 
fluctuations  under  weak  turbulent  conditions  suggests  an 
increase  in  the  biological  contributions  to  the  control  of 
phytoplankton  biomass  distribution,  and  confirms  previous 
observations  (cf  Fig.  7).  Phytoplankton  distribution  then 
appears  independent  of  the  temperature  distribution  under 
the  lowest  turbulence  levels  investigated  here,  i.e.,  10“^  m" 
s'^  (Fig.  10a). 

More  precisely,  the  shape  of  the  function  r{p,q)  indi¬ 
cates  that  large  phytoplankton  fluctuations  are  associ¬ 
ated — under  strong  enough  turbulent  conditions — ^to  weak 
temperature  gradients,  and  conversely.  This  tendency 
seems  to  reflect,  over  a  slightly  wider  range  of  scales,  find¬ 
ings  of  Desiderio  et  al  (1993);  they  observed  the  occur¬ 
rence  of  0.1 -0.2  meter  thick  fluorescent  layers  just  above 
local  temperature  gradients.  While  these  qualitative  com¬ 
ments  suggest  a  proportionality  relationship  between  tem¬ 
perature  and  phytoplankton  fluctuations,  the  relation  (16) 
has  never  been  verified.  The  phytoplankton  fluctuations 
then  cannot  be  regarded  as  strictly  proportional  or  ran¬ 
domly  proportional  to  the  fluctuations  of  temperature,  i.e., 
AF  =  KAT  and  AF  =  kAT  ,  respectively.  In  contrast,  the 
relation  (17)  has  been  verified  testing  the  validity  of  Eq. 
(18)  over  a  wide  range  of  b  values.  Using  b  values  ranging 
between  0.05  and  5  (with  0.05  increments),  we  then 
showed  that  Eq.  (18)  is  verified  for  three  of  the  five  levels 
of  turbulence  investigated  here:  e  =  10"^  m^  s’^  with 

b  =  2.69,  £  =  5.10"^  m^  s’^  with  Z>  =  2.25,and  £  =  10“''  m^ 
s'^  with  6  =  1.76.  Figure  11  shows  the  relation  /*(p,0)  vs. 
r{0,bp)  corresponding  to  the  functions  r{p,q)  shown  in 
Fig.  10b  and  c,  with  6  =  2.69  and  6  =  1.76,  respectively. 
The  correlation  shown  between  temperature  and  phyto¬ 
plankton  fluctuations  under  high  turbulent  conditions,  is 
then  related  to  a  power-law  dependence  relationship,  i.e. 
AF  =  K,{ATf . 

Let  us  mention  here  that  at  this  stage  of  the  development 
of  the  multi  scaling  techniques  illustrated  in  the  present 
paper,  the  presence  and  the  absence  of  evidence  for  power- 
law  dependence  between  phytoplankton  and  temperature 
fluctuations  cannot  (a  priori)  be  regarded  as  a  general  rule 
driving  biophysical  couplings  in  pelagic  ecosystems.  In¬ 
deed,  the  relationship  between  temperature  and  in  vivo 
fluorescence  or,  more  generally,  biophysical  couplings, 
can  be  influenced,  and  then  complexified,  by  both  the 
processes  related  to  phytoplankton  cells  ecology  and  to  the 
sampling  strategy.  Over  the  range  of  scales  investigated  in 
the  present  study,  in  vivo  fluorescence  fluctuations  can 
indeed  result  from  nonlinear  interactions  between  the 
physiological  state  of  phytoplankton  cells,  their  related 
coagulation  ability,  the  specific  composition  of  phyto¬ 
plankton  assemblages,  the  grazing  impact  of  copepods 
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Figure  10.  The  Generalized  Con*elation  Exponents  (GCE 
r(p,q) ,  shown  as  a  function  of  both  p  and  q  .  Here  the  statisti¬ 
cal  orders  of  moment  p  and  q  characterize  temperature  and  in 
vivo  fluorescence  fluctuations,  respectively.  The  functions 
r{p,q)  correspond  to  three  different  levels  of  turbulence,  i.e.  (A) 

£  =  10"'  (B),  10“',and(C)  10"'  m^s•^ 


(shown  recently  to  influence  both  spectral  and  multifractal 
properties  of  phytoplankton  biomass;  SeuronL  1999; 
Lovejoy  et  al,  2001),  and  the  turbulent  processes.  The  re¬ 
sulting  fluctuations  can  then  be  potentially  more  complex 
than  those  observed  for  a  purely  passive  scalar  such  as 
temperature. 
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Figure  11.  Plots  of  the  Generalized  Correlation  Exponents 
r(/7,0)  versus  rifi^bp)  for  two  levels  of  turbulence,  e  =  10^ 

with  b  =  2.69  (A)  and  e  =  10"-  with  b-\J(>  (B).  The  slopes 
of  the  regression  lines  (continuous  line)  cannot  be  distinguished 
from  that  of  the  relation  bp^q  (dashed  line).  This  shows  the 
validity  of  Eq.  (18)  and  the  symmetry  of  the  functions  r(p,q)  in 
the  bp  —  q  plane. 

On  the  other  hand,  considering  (i)  the  highly  intrusive 
character  of  the  sampling  device  used  to  record  tempera¬ 
ture  and  in  vivo  fluorescence  (a  Sea-Bird  25  Sealogger 
CTD  probe  and  a  Sea  Tech  fluorometer,  respectively),  and 
(ii)  the  spatial  separation  of  the  temperature  and  fluores¬ 
cence  sensors  («10  cm),  it  is  reasonable  to  think  that  the 
most  extreme  events,  which  are  also  the  most  infrequent 
and  contribute  to  the  highest  values  of  the  moments  p  and 


q  ,  are  not  necessarily  simultaneously  sampled  in  a  proper 
way,  leading  to  spurious  joint  high-order  properties.  In 
particular,  this  could  explain  the  weakening  of  the  Gener¬ 
alized  Correlation  Exponents,  r{p,q)^  for  values  of  the 
moments  p  and  q  such  as  /? > 4  and  q>A .  The  latter 
limitation  will  be  easily  fixed  in  the  near  future  with  the 
increasing  development  and  availability  of  microstructure 
profilers  able  to  record  both  physical  and  biological  pa¬ 
rameters  with  a  high-spatial  resolution  from  mildly  intru¬ 
sive  sensors  {Seuront  et  ai,  200 Id, e).  The  former  limita¬ 
tion  will  be  difficult  to  overcome,  because  of  the  intrinsic 
physiological,  biological,  and  ecological  processes  affect¬ 
ing  phytoplankton  populations. 

Conclusions:  the  cost  and  gain  of  increasing 
analysis  complexity 

Our  results  demonstrate  that  single  scaling  properties  of 
temperature  and  phytoplankton  would  have  erroneously 
led  to  conclude  that  the  latter  could  be  regarded  as  a  purely 
passive  scalar.  However,  investigations  of  their  detailed 
variability  in  the  universal  multifractal  framework  lead  to 
consideration  that  phytoplankton  biomass  presents  very 
specific  properties,  depending  on  the  intensity  of  local 
turbulent  processes.  This  suggests  a  differential  control  of 
phytoplankton  biomass  distribution  involving  complex 
interactions  between  turbulent  velocity  fields  and  the  in¬ 
volved  particles  properties  such  as  buoyancy,  size,  density 
or  aggregative  properties.  In  particular,  this  assertion  is 
supported  by  the  specific  behavior  of  phytoplankton  mul¬ 
tiscaling  properties  under  low  turbulent  conditions,  sug¬ 
gesting  the  prevalence  of  phytoplankton  cells  specific 
properties  on  turbulent  processes. 

This  has  been  confirmed  and  specified  by  the  introduc¬ 
tion  of  the  Generalized  Correlation  Functions/Exponents. 
The  correlation — and  then  the  couplings — between  tem¬ 
perature  and  phytoplankton  biomass  fluctuations  are  thus 
higher  under  high  hydrodynamic  forcing  and  decrease  pro¬ 
gressively  to  collapse  under  low  hydrodynamic  forcing.  In 
particular,  careful  examination  of  the  functions  r(p,q) 
indicates  that  large  phytoplankton  fluctuations  are  associ¬ 
ated — under  strong  enough  turbulent  conditions — with 
weak  temperature  gradients,  and  conversely.  These  results 
are  fully  congruent  with  more  intuitive  and  qualitative  re¬ 
sults  by  Desiderio  et  al.  (1993)  over  similar  scales  and 
Wolk  et  al  (2001)  at  smaller  scales.  Whatever  that  may  be, 
even  if  additional  work  and  data  sets  will  be  needed  to 
achieve  more  definitive  functional  scenarios,  the  joint  mul¬ 
tifractal  framework  introduced  here  provides  the  first  ob¬ 
jective  method  to  characterize,  both  qualitatively  and 
quantitatively,  the  details  of  the  relationships  between  any 
pair  of  simultaneously  recorded  patterns  or  processes.  In 
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particular,  this  framework  makes  no  assumptions  about  the 
spectrum  or  the  distribution  of  either  data  sets,  and  it  takes 
fully  into  account  their  intrinsic  multiscaling  properties. 
We  should  also  note  here  that  this  work  provides  the  very 
first  evidence  of  the  multifractal  character  of  ocean  turbu¬ 
lent  velocity  fluctuations,  and  reveals  their  extreme  simi¬ 
larity  in  the  ocean  and  the  atmosphere. 

Finally,  regarding  the  additional  quantity  of  information 
brought  by  the  univariate  and  bivariate  multiscaling  proce¬ 
dures  described  here  to  our  understanding  of  the  structure 
of  velocity,  temperature  and  phytoplankton  fluctuations,  as 
well  as  of  the  couplings  between  temperature  and  phyto¬ 
plankton  biomass,  we  claimed,  as  already  stressed  else¬ 
where  (e.g.,  Yamazaki,  1993;  Seuront,  2001 ;  Seuront  et  al, 
2001c),  that  taking  into  account  the  reality  of  the  intermit¬ 
tent  nature  of  bio-physical  microstructures  could  be  the 
first  step  towards  a  general  understanding  of  structures  and 
functions  in  marine  pelagic  ecosystems.  The  quite  elevated 
cost  associated  with  making  use  of  the  single  and  joint 
multifractal  tools  presented  here  in  terms  of  algorithmic 
developments  and  computation  time  consumption  will  then 
be  easily  overcome  by  the  priceless  gain  in  emerging 
knowledge. 
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Abstract.  Our  turbulent  lengthscale,  velocity  and  diffusivity  scalings  are 
compared  with  data  from  other  numerical,  laboratory  and  field  experiments, 
Comparison  is  made  with  reference  to  the  turbulence  intensity  measme 
e/uN"^.  We  showed  that  our  turbulent  lengthscale  and  velocity  results  are 
consistent  with  measurements  from  the  experiments  considered,  and  that 
the  rms  turbulent  lengthscale  Lt  is  independent  of  the  rate  of  dissipation 
of  turbulent  kinetic  energy  when  e/ul^  >  300.  A  diffusivity  modelled  in 
terms  of  an  advective  buoyancy  flux,  &,  is  found  to  reproduce  our  direct 
measiuements  of  Kp  in  the  experiments  considered  when  e/uISP  >  300  .  It 
is  shown  that  modelling  Kp  as  0.2e/iV^  is  a  poor  parameterisation  of  the 
advective  buoyancy  flux  model  in  all  the  experiments  considered,  and  that 
at  large  e/i/A^,  this  parameterisation  can  over-predict  the  true  Kp  by  two 
orders  of  magnitude.  This  overprediction  is  discussed  in  terms  of  a  miviTig 
efficiency  and  it  is  shown  that  in  the  experiments  considered  the  mixing 
efficiency  decreases  rapidly  with  increasing  Finally,  the  application 

of  ovu  diffusivity  scaling  to  other  geophysical  flows  is  discussed,  and  it  is 
shown  that  a  necessary  requirement  for  the  use  of  this  scaling  is  that  Lt  is 
independent  of  the  rate  of  dissipation  of  turbulent  kinetic  energy. 


Introduction 

A  clear  understanding  of  the  irreversible  vertical 
transport  of  mass  in  a  stably  stratified  turbulent  fiow 
is  fundamental  to  quantifying  the  dynamics  of  density 
stratified  fluids.  The  rate  at  which  this  transport  occurs 
has  been  been  widely  modelled  as  a  turbulent  diffusiv- 
ity  for  mass,  Kp.  Since  Kp  influences  the  distribution 
of  heat,  mass,  contaminants  and  biota  throughout  a 
turbulent  fluid  (e.g.  Tennekes  and  Lumley,  1972),  an 
understanding  of  this  quantity  is  essential  to  the  man¬ 
agement  of  aquatic  systems  such  as  lakes,  estuaries  and 
the  oceans. 

The  simplest  model  describing  turbulent  transport 
in  homogeneous  isotropic  turbulence  relates  a  turbu¬ 
lent  diffusivity  AT  to  a  turbulent  velocity  scale  U  and  a 
turbulent  integral  lengthscale  L  (Tennekes  and  Lumley, 
1972), 

K^UxL.  (1) 


is 


where  u  is  the  fluctuating  component  of  velocity  and 
I  is  a  linear  dimension  defining  the  scale  of  the  turbu¬ 
lent  field.  If  we  assume  that  {7,  u  and  L,  I  are  well  rep¬ 
resented  by  their  root  mean  square  (ms)  velocity  and 
length  scales  Ut  and  Lt  respectively,  (1)  and  (2)  can  be 
combined  to  give 

(3) 

The  rate  at  which  a  stratifying  scalar  B  is  irreversibly 
mixed  in  a  turbulent  flow  has  generally  been  modelled 
using  an  average  vertical  advective  flux  fl'ty',  where  & 
and  ly'  are  the  instantaneous  scalar  and  vertical  velocity 
fluctuations  respectively  (see  Gregg,  1987,  for  a  full  re¬ 
view).  This  advective  flux  has  been  formulated  in  terms 
of  an  eddy  coefficient  (or  diffusivity)  Kq  as 


dz 


This  expression  for  K  is  considered  to  represent  a  bulk 
eddy  diffusivity.  Taylor  (1935)  found  that  the  rate  of 
dissipation  of  turbulent  kinetic  energy  per  unit  mass,  e. 
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where  80/ dz  is  the  mean  vertical  scalar  gradient  on 
which  the  advective  flux  acts.  Osbom  (1980)  used 
(4)  to  model  Kp  in  the  shear  driven  environment  of 
the  ocean  thermocline.  By  using  the  turbulent  ki¬ 
netic  energy  equation,  0 shorn  (1980)  suggested  that 
Kp  <  0.2e/iV’^,  where  the  multiplicative  constant  is 
equal  to  Rf /{I  —  i?/),  and  Rj  is  a  mixing  efficiency 
taken  to  be  less  than  or  equal  to  0.15.  This  model 
for  Kp  has  often  been  used  in  geophysical  applications 
other  than  originally  intended,  and  with  the  inequality 
written  as  an  equality  such  that 

^p  =  0-24-  (5) 


Winters  and  D’Asaro  (1996)  argued  that  rather  than 
modelling  Kp  with  (4),  it  can  be  computed  directly  as 


Kp  = 


<t>d 

As.  ’ 

dz^ 


(6) 


where  is  the  irreversible  diffusive  flux  of  density 
across  an  isoscalar  surface  in  a  turbulent  flow  and 
dp/dz^  is  the  resorted  density  gradient.  Using  this  defi¬ 
nition,  Barry  et  al  (2001)  conducted  controlled  labora¬ 
tory  experiments  that  directly  and  independently  mea¬ 
sured  Kp^  the  rate  of  dissipation  of  turbulent  kinetic 
energy  e,  a  turbulent  lengthscale  Lt  and  the  buoyancy 
frequency  =  —{g/pQ){dp/dz)  in  a  closed,  shear  free, 
linearly  salt  stratified  fluid.  Turbulence  was  generated 
in  the  entire  fluid  volume  by  the  steady  motion  of  a 
horizontally  oscillating  vertical  rigid  grid  comprising  1- 
cm  bars  at  a  5-cm  spacing.  A  schematic  diagram  of 
the  experimental  configuration  is  shown  in  Figure  1, 
and  full  details  are  given  in  Barry  et  al  (2001).  Barry 
et  al  (2001)  described  the  behaviour  of  Kp  with  ref¬ 
erence  to  the  turbulence  intensity  parameter  e/vN’^, 
where  v  is  the  kinematic  viscosity  of  the  fluid.  Two 
regimes  were  identified;  regime  E  (energetic  turbu¬ 
lence,  >  300)  and  regime  W  (weak  turbulence, 

e/i/N^  <  300).  It  was  found  that 


=  (7) 

=  (8) 

for  regimes  E  and  W  respectively.  For  regime  E,  Barry 
et  al  (2001)  also  found  that 


Figure  1.  Schematic  diagram  of  the  front  view  of  the  ex¬ 
perimental  setup.  The  tank  is  520  mm  long,  and  was  filled 
with  a  linearly  salt  stratified  fluid.  The  vertical  rigid  grid 
stirred  the  entire  fluid  volume. 


rms  quantities)  such  that 

(>») 

where  Pr  =  v/k  is  the  Prandtl  number  of  the  fluid. 
This  rms  velocity  scaling  is  the  same  as  that  obtained 
by  using  the  direct  measurements  of  Kp  and  Lt  and 
solving  for  in  (1).  No  velocity  or  lengthscale  relations 
were  identified  in  regime  W. 

The  primary  focus  of  this  paper  is  to  compare  the 
Barry  et  al  (2001)  direct  diffusivity  measurements  in 
regime  E  with  those  inferred  (via  (4))  using  the  numer¬ 
ical  data  of  Ivey  et  al  (1998),  the  laboratory  data  of 
Stillinger  et  al  (1983b),  Rohr  (1985),  and  Itsweire  et 
al.  (1987),  and  the  field  data  of  Saggio  and  Imberger 
(2001).  We  also  use  the  same  data  sets  to  assess  the 
generality  of  the  Barry  et  al  (2001)  lengthscale  and  ve¬ 
locity  relations  in  the  same  regime. 

The  data  sets 


Lt  =  20 


(i/k)^/^ 


(9) 


where  is  the  convective  lengthscale,  Leo- 

This  lengthscale  was  related  to  Ut  using  (2)  (and  assum¬ 
ing  that  I  and  u  are  well  represented  by  their  respective 


Direct  numerical  simulations 

Using  the  code  described  by  Holt  et  al  (1992),  Ivey  et 
al  (1998)  undertook  a  series  of  direct  numerical  simula¬ 
tions  (DNS)  of  stratified  shear  flows  using  a  128^  point 
computational  domain  in  which  and  the  square  of 
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the  mean  vertical  shear  =  (dUfdz)^  were  held  con¬ 
stant.  Although  only  one  value  of  was  used  for 
all  the  simulations  discussed  here,  N^  was  varied  be¬ 
tween  simulations  such  that  the  Richardson  number, 
Ri  =  N^/ varied  from  0.075  to  1.0.  The  experiments 
were  run  for  non-dimensional  shear  time,  0  <  <  8, 

however  we  consider  the  results  from  only  St  >  2  where 
the  turbulence  becomes  self-adjusted  and  has  lost  mem¬ 
ory  of  the  initialisation  {Ivey  et  al,  1998).  At  each  time- 
step  within  a  simulation,  all  turbulent  quantities  were 
calculated  as  ensemble  averages  over  the  entire  fluid  vol¬ 
ume.  With  the  application  to  geophysical  flows  in  mind, 
we  consider  only  the  DNS  experiments  with  Pr  >  1. 

Laboratory  experiments 

The  experiments  of  Stillinger  et  al  (1983b),  Rohr 
(1985),  and  Itsweire  et  al  (1987)  measured  turbulent 
quantities  at  different  points  downstream  of  a  fixed  grid 
at  the  head  of  a  water  tunnel  (see  Stillinger  et  al,  1983a, 
for  a  full  description  of  the  experimental  configuration). 
These  experiments  used  a  variety  of  grids  with  bars 
of  diameter  0.318  to  0.635  cm  at  spacings  of  1.905  to 
3.81  cm.  In  all  the  experiments,  salt  stratified  water 
was  continually  recirculated  through  the  tunnel,  allow¬ 
ing  the  collection  of  timeseries  data  at  several  positions 
downstream  of  the  stationary  grid.  Average  turbulent 
quantities  were  computed  at  these  positions  by  aver¬ 
aging  the  collected  timeseries  at  each  point.  The  Rohr 
(1985)  experiments  included  mean  shear.  Prom  numeri¬ 
cal  simulations,  Itsweire  et  al  (1993)  found  that  in  these 
experiments,  e  was  underestimated  by  approximately  a 
factor  of  2.  We  have  applied  this  correction  factor  to 
the  data  sets  for  this  analysis. 

Field  experiments 

Saggio  and  Imherger  (2001)  measured  turbulent  prop¬ 
erties  in  the  metalimnion  of  Lake  Kinneret,  Israel,  dur¬ 
ing  three  consecutive  summer  periods.  During  these 
experiments,  a  portable  flux  profiler  (PFP)  capable  of 
resolving  all  three  turbulent  velocity  components  was 
traversed  vertically  through  the  water  column  at  ap¬ 
proximately  10  cm/s.  This  probe  also  allowed  the  mea¬ 
surement  of  the  instantaneous  vertical  advective  buoy¬ 
ancy  flux  p'ty',  and  the  rate  of  dissipation  of  turbulent 
kinetic  energy,  e. 


Results 

Turbulent  lengthscales  and  velocities 


Figure  2.  Root  mean  square  turbulent  velocity  Ut  vs 
in  cm/s;  •  ,  Stillinger  et  al  (1983b);  o  ,  Rohr 
(1985);  -h  ,  Itsweire  et  al  (1987);  *  ,  Ivey  et  al  (1998). 
The  solid  line  is  slope  1,  with  the  equation  given  by  (13). 


Ut  =  V'2(u'2)  +  (11) 

for  the  laboratory  data  and 

Ut  =  y/ -h  (12) 


for  the  DNS  data.  The  solid  line  in  Figure  2  is  slope  1 
and  has 

Ut  =  1.5(eLt)'/"  (13) 

consistent  with  Ivey  et  al  (1998).  Figure  3  shows  the 
normalised  turbulent  velocity  scale  against  efi/N^  for 
the  same  data,  where  the  solid  line  is  given  by 


Ut 

(i/Ar)V2pr"^/^^ 


(14) 


Equations  (13)  and  (14)  imply  that  the  rms  turbulent 
lengthscale  Lt  in  the  DNS  and  laboratory  data  is  de¬ 
scribed  by 

=  (15) 

Similarly,  Saggio  and  Imherger  (2001)  found  for  field 
data  in  a  heat  stratified  lake  that 


Ut 

(i/Ar)V2pr^-i/i2 


(16) 


In  Figure  2  we  plot  Ut  against  (eLt)^/^,  for  the  above 
numerical  and  laboratory  data  sets  where  efvN^  >  300, 
This  rms  quantity  was  computed  as 


and 

Lt  =  15Z/CO,  (^7) 

where  Ut  was  computed  using  (12). 
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Figure  3.  Normalised  txirbulent  velocity 

vs  efi/N^;  •  ,  Stillinger  et  al 

(1983b);  o  ,  Rohr  (1985);  +  ,  Itsweire  et  al  (1987);  ♦  ,  Ivey 
et  al.  (1998).  The  solid  line  is  slope  1/3,  with  the  equation 
given  by  (14). 

Buoyancy  flux  estimates  of  Kp 

In  Figure  4  we  plot  the  quantity  (6/7V’^)/(2/^/^k^/^) 
against  ejvN^,  where  b  =  {g/po^p^  is  the  buoyancy 
flux.  The  solid  lines  are  given  by 


- . ^  -04/  «  V''" 

Ar2(i/2/3«l/3) 

(18) 

for  the  laboratory  data  and 

h  /  e  xi/3 

(1/273^1/3)  ^\uN^) 

(19) 

for  the  DNS  data  where  >  300. 

Saggio  and  Imberger  (2001)  found  from  field 
Lake  Kinneret  that  when  eJuN^  >  36, 

data  in 

^  12  C  '  V'" 

jV2(j^2/3«l/3)  ■ 

(20) 

Discussion 

The  functional  forms  of  equations  (14),  (16)  and  (15), 
(17)  are  consistent  with  the  Barry  et  al  (2001)  scalings 
in  (10)  and  (9),  respectively.  In  particular,  (15)  and  (17) 
demonstrate  that  in  all  these  data  sets  the  turbulent 
lengthscale  is  not  a  function  of  the  rate  of  dissipation  of 


Figure  4.  vs  e/uN^;  •  ,  Stillinger  et  al 

(1983b);  o  ,  Rohr  (1985);  +  ,  Itsweire  et  al  (1987);  *  ,  Ivey 
et  al.  (1998).  The  equations  of  the  lines  are  given  in  the 
text. 


turbulent  kinetic  energy  when  e/i/N^  >  300.  Further, 
these  velocity  scales  are  consistent  with  the  results  of 
Barry  et  al  (2001)  who  suggested  that  for  a  given  N  in 
regime  E,  an  increase  in  e  does  not  result  in  an  increase 
in  the  turbulent  overturn  scale  but,  rather,  leads  to  an 
increase  in  the  magnitude  of  the  rms  turbulent  velocity. 

Although  we  note  some  variation  in  the  constant 
coefficient,  the  agreement  between  the  turbulent  ve¬ 
locity  and  lengthscale  relations  of  Barry  et  al  (2001) 
and  those  of  the  data  sets  presented  here  suggests  that, 
to  first  order,  the  configuration  of  the  turbulence  gen¬ 
eration  mechanism  does  not  influence  the  physics  of 
the  turbulence  when  e/i/N^  >  300.  Since  all  data 
sets  are  reasonably  well  described  by  Lt  ~  Leo  and 
Utl{{vNY/^Pr-^l^^)  ~  (e/i/iV2)V3^  we  assume  that 
(1)  also  applies  to  the  experiments  detailed  above  and, 
if  it  were  directly  measured,  Kp  would  be  well  described 
by  (7)  in  regime  E. 

Figure  4  shows  that  for  regime  E  there  is  good 
agreement  between  (7)  (given  by  the  solid  line)  and 
6/(iV’^2/^/^K^/^)  computed  using  the  laboratory  data. 
Although  there  is  some  scatter  in  the  laboratory  data 
(consistent  with  the  scatter  in  the  velocity  and  length- 
scale  measurements  shown  in  Figures  2  and  3),  the  trend 
in  is  consistent  with  (7)  in  regime  E.  We 

conclude  that  in  these  laboratory  experiments,  (4)  can 
be  used  to  satisfactorily  approximate  the  true  diffusiv- 
ity  measured  by  Barry  et  al  (2001)  when  >  300. 
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Figure  5.  vs  -  ,  StUhnger  et  al. 

(1983b);  o  ,  Rohr  (1985);  +  ,  Itsweire  et  al.  (1987).  The 
solid  line  is  with  Pr  =  700. 


The  quantity  b/{N^i>^/^K^/^)  computed  from  both 
DNS  and  Saggio  and  Imberger  (2001)  data  shows  the 
same  functional  dependence  on  {e/uN'^f/^  as  (7)  when 
e/vN"^  >  300.  This  implies  that  the  physics  of  the  tur¬ 
bulence  in  regime  E  (i.e.  the  relationship  between  Kp, 
Ut  and  Lt)  is  well  described  by  the  sceile  forms  of  the 
Barry  et  al.  (2001)  results.  We  note,  however,  that  the 
constant  coefficients  in  the  DNS  and  field  data  sets  are 
different  from  those  of  the  Barry  et  al.  (2001)  measure¬ 
ments.  It  is  currently  not  clear  why  this  should  be  the 
case. 

Figure  5  shows  the  estimates  of  Kp  from  the  labora¬ 
tory  data  sets  using  (4)  compared  with  the  prediction 
from  Osborn  (1980)  in  (5)  for  Pr  =  700.  The  figure 
shows  that  for  e/i/N^  >  300,  the  model  predictions 
from  (4)  and  (5)  diverge.  For  highly  energetic  turbu¬ 
lence,  the  two  estimates  are  widely  disparate,  such  that 
at  efvN^  «  10®  they  are  different  by  more  than  two 
orders  of  magnitude. 

Barry  et  al.  (2001)  argued  that  the  most  likely  cause 
of  this  divergence  for  large  e/vN"^,  is  the  selection  of  a 
constant  mixing  efficiency  Rj  in  deriving  (5).  Ivey  and 
Imberger  (1991)  described  the  behaviour  of  this  mixing 
efficiency  in  terms  of  the  turbulent  Proude  number. 


,  (21) 


and  the  turbulent  Reynolds  number, 

(22) 

where  Lg  =  (e/AT®)^/^  is  the  Ozmidov  scale,  Lc  is  a 
centred  displacement  scale  equivalent  to  Lt,  and  L^  = 
(i/®/e)^/'*  is  the  Kolmogorov  scale.  In  particular,  Ivey 
and  Imberger  (1991)  found  that  Rf  has  a  maximum 
of  approximately  0.2  at  Frx  w  1.2  (or  Lg  «  Lt),  and 
that  Rf  decreases  rapidly  when  Frr  deviates  from  this 
value,  such  that  for  Frr  >  1.2, 


Rf  = 


1 

1  -b  3Fr2 


(23) 


for  large  Rex-  If  we  assume  that  Lt  (and  hence  Lc)  is 
well  described  by  27.5ico  for  e/vN"^  >  300  (where  the 
coefficient  of  27.5  is  a  representative  value  from  the  Still- 
inger  et  al.  (1983b),  Rohr  (1985),  Itsweire  et  al.  (1987) 
and  Barry  et  al.  (2001)  data  sets),  we  can  substitute 
27.5Tco  for  Lc  in  (21)  and  (22)  to  give 


(24) 

=  (25) 

respectively.  Ivey  and  Imberger  (1991)  argued  that  any 
two  of  the  non-dimensional  parameters  Fry,  Rer  and 
e/vN^  are  necessary  and  sufficient  to  describe  the  dy¬ 
namics  of  a  general  turbulent  field.  However,  equations 
(24)  and  (25)  show  that  for  regime  E  in  the  above  ex¬ 
periments  we  need  only  one. 

We  plot  Frr  against  ejuN^Pr^^^  in  Figure  6,  for 
efvN^  >  300,  where  the  solid  line  is  given  by  (24). 
Equations  (23)  and  (24)  show  that  Rf  necessarily  de¬ 
creases  with  increasing  e/z/AT^, 


l+0.0A{^f'^Pr^/^' 

Figure  7  shows  Rf  against  e/vN^  for  these  data  sets, 
and  clearly  demonstrates  that  a  diffusivity  mndHled  as 
Kp  =  Rf/  {1-Rf)e/N^  with  Rf  constant  at  0.15  (shown 
in  Figure  7  as  a  horizontal  solid  line)  will  overestimate 
the  true  diffusivity  at  large  e/i/AT^  in  these  experiments. 
This  result  is  consistent  with  the  divergence  of  the  es¬ 
timates  of  Kp  from  (4)  and  (5)  shown  in  Figure  5. 


Application  to  geophysical  flows 

Equations  (24)  and  (26)  hold  if,  and  only  if,  the  tur¬ 
bulent  lengthscale  Lt  is  well  described  by  Lgg.  Sev¬ 
eral  field  investigations  however,  (e.g.  Crawford,  1986; 
Gregg,  1987;  Peters  et  al.,  1988;  Imherger  and  Ivey, 
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Figure  6.  Turbulent  Proude  number  Frr  vs 
■  ,  Stillinger  et  al.  (1983b);  o  ,  Rohr 
(1985);  +  ,  Itsweire  et  al.  (1987);  A  ,  Barry  et  al.  (2001);  x  , 
Perron  et  al.  (1998).  The  solid  line  is  . 


^  * . . . 1  . 1 
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Figure  7.  Rf  vs  e/i/iV^;  •  ,  Stillinger  et  al.  (1983b);  o  ,  Rohr 
(1985);  +  ,  Itsweire  et  al.  (1987);  A  ,  Barry  et  al.  (2001). 


1991;  Peters  et  al..,  1995;  Perron  et  al..,  1998)  have  found 
that  in  some  geophysical  flows  Lt  ^  Lo,  or  equivalently, 
that  Frx  ^  1-  In  these  situations,  this  relationship  be¬ 
tween  Lt  and  Lq  has  been  found  to  hold  regardless  of 
the  value  of  e/uN’^.  For  instance,  Perron  et  al.  (1998) 
observed  that  in  the  Romanche  fracture  zone, 

Lo  =  0,95(±0.6)Lt,  (27) 

even  when  e/uN^  ^  500, 000,  where  Lt  is  the  Thorpe 
displacement  scale  and  is  a  measure  of  Lt.  The  corre¬ 
sponding  value  of  Prx  is  order  1  and  is  shown  in  Fig¬ 
ure  6  as  an  x.  The  Perron  et  al.  (1998)  measurements 
clearly  do  not  conform  to  the  trends  of  the  data  sets 
previously  considered  here,  fundamentally  because  Lt 
does  not  scale  like  Leo- 

The  measurements  of  Perron  et  al.  (1998)  imply  that 
Prr  is  always  approximately  unity,  regardless  of  the 
magnitude  of  efi/N^.  The  arguments  of  Ivey  and  Im- 
berger  (1991)  then  require  that  the  mixing  efliciency  in 
the  Romanche  fracture  zone  is  constant  and  maximal, 
and  hence  the  large  value  of  efuN^  cannot  be  used  in 
isolation  to  infer  that  Rj  is  vanishingly  small.  Rather, 
either  Frx  or  Rex  must  also  be  included  in  a  description 
of  the  turbulent  dynamics.  In  such  cases,  the  scaling  re¬ 
sults  of  Barry  et  al.  (2001)  should  not  be  expected  to 
hold. 

Conclusions 

We  have  compared  the  turbulent  lengthscale,  ve¬ 
locity  and  diffusivity  scalings  presented  by  Barry  et 
al.  (2001)  with  data  from  Ivey  et  al.  (1998)  DNS  ex¬ 
periments,  laboratory  experiments  of  Stillinger  et  al 
(1983b),  Rohr  (1985),  and  Itsweire  et  al.  (1987)  and 
the  field  experiments  of  Saggio  and  Imberger  (2001). 
For  efi/N^  >  300,  the  Barry  et  al.  (2001)  velocity  and 
lengthscale  results  are  good  descriptors  of  the  physics 
of  the  turbulence  in  all  these  experiments,  to  constant 
coefficient.  We  have  shown  that  the  turbulent  length- 
scales  in  all  these  experiments  in  this  regime  are  inde¬ 
pendent  of  the  rate  of  dissipation  of  turbulent  kinetic 
energy.  This  in  turn  suggests  that  for  a  given  AT,  an  in¬ 
crease  in  e  does  not  increase  the  turbulent  lengthscale, 
but  rather,  leads  to  an  increase  in  the  magnitude  of  the 
turbulent  velocity. 

We  have  shown  that  when  e/i/AT^  >  300,  a  diffusivity 
modelled  using  (4),  is  a  good  representation  of  the  direct 
measurements  of  Barry  et  al.  (2001)  for  the  experiments 
considered,  to  constant  coefficient. 

We  have  discussed  the  parameterisation  of  (4)  as 
Kp  =  0.2e/iV^  and  have  shown  that,  in  the  data  sets 
considered,  this  overestimates  the  true  diffusivity  for 
>  300.  We  have  suggested  that  this  is  a  conse¬ 
quence  of  selecting  a  constant  mixing  efficiency  i?/  = 
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0.15,  as  for  increasing  e/vN^,  Rf  necessarily  decreases 
in  the  data  sets  considered. 

Finally,  we  have  discussed  the  application  of  the 
Barry  et  al  (2001)  scalings  to  geophysical  flows  and 
have  suggested  that  a  necessary  requirement  for  the 
use  of  these  scalings  is  that  the  tiurbulent  lengthscale 
Lt  is  independent  of  e.  If  this  constraint  is  satisfied 
(i.e.  Lt  ~  Leo)  then  Kp,  and  indeed  all  the  turbulent 
dynamics,  can  be  described  in  terms  of  only  a  single  pa¬ 
rameter  When  this  lengthscale  constraint  is  not 

satisfied,  the  Barry  et  al.  (2001)  scalings  do  not  apply. 
Future  work  must  now  investigate  the  circumstances 
under  which  this  lengthscale  constraint  is  satisfied  in  a 
variety  of  geophysical  flows. 
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Abstract.  An  investigation  into  density  and  velocity  fluctuations  in  the 
oceanic  thermocline  is  presented.  Two  kinds  of  numerical  simulation  are 
reported.  In  the  first,  an  attempt  is  made  to  capture  the  transition  from 
breaking  internal  waves  to  the  small-scale  turbulence  they  generate.  The 
model  used  for  this  is  based  on  a  continual  forcing  of  a  large-scale  standing 
mternal-wave.  Evidence  is  presented  for  a  transition  in  the  energy  spectra 
from  the  anisotropic  k  ^  buoyancy  range  to  the  small-scale  isotropic 
mertial  range.  Density  structures  that  form  during  wave  breaking  are  analyzed 
and  regions  of  mixing  associated  with  the  breaking  events  are  visualized.  In 
the  second  kind  of  simulation,  internal-wave  packets  are  followed  as  they 
propagate  through  the  thermocline.  It  is  found  that  the  breaking  of  crests 
within  the  packet  can  lead  to  overturning  events  on  the  scale  observed  in  the 
ocean,  and  the  subsequent  turbulence  can  form  a  continuous  wake. 


Introduction 

In  recent  observations  of  fluctuations  in  the  oceanic 
thermocline,  Alford  and  Pinkel  (1999,  2000)  found  many 
overturns  with  vertical  scale  of  about  2  m  and  these 
were  highly  correlated  with  the  presence  of  energetic 
waves  with  vertical  wavelengths  on  the  order  of  10  m. 
Large  scale  fluctuations,  say  10  m  and  above  in  ver¬ 
tical  scale,  can  be  described  reasonably  well  as  inter¬ 
nal  waves.  For  much  smaller  scales,  say  1  m  and  be¬ 
low,  the  flow  is  probably  better  described  in  terms  of 
nearly  isotropic  tmbulence.  Intermediate  between  the 
large-scale  wave  dynamics  and  the  small-scale  turbu¬ 
lence  is  a  transition  regime  in  which  there  is  a  com¬ 
petition  between  waves  and  turbulence.  It  is  this 
intermediate  range,  often  called  the  buoyancy  range, 
that  contains  the  overturning  activity  observed  by  Al¬ 


ford  and  Pinkel.  Since  the  observations  are  essen¬ 
tially  one-dimensional  in  space,  a  direct  numerical  sim¬ 
ulation  which  could  faithfully  describe  events  in  this 
r^ge  would  help  toward  understanding  the  full  three- 
dimensional  flow  structures  behind  the  observations. 
The  first  part  of  oin  investigation  will  focus  on  the 
production  of  overturns  by  an  idealized  internal  wave 
forcing. 

The  second  part  of  our  investigation  concerns  the 
propagation  of  internal-wave  packets  through  the  ther¬ 
mocline.  Alford  and  Pinkel  (1999,  2000)  observed  co¬ 
herent  regions  of  strong  oscillatory  vertical  strain  rate 
that  travel  vertically  through  100  m  or  more  of  the  ther¬ 
mocline.  These  propagating  structures  had  an  internal 
wave  structure  with  vertical  wavelength  of  about  10  m, 
and  the  entire  coherent  region  could  be  described  by 
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an  envelope  of  about  50  m  in  vertical  extent.  These 
coherent  structures  were  strongly  correlated  with  2  m 
overturns.  Given  the  complicated  nature  of  the  flow, 
with  strong  components  of  large-scale  advection,  it  was 
difficult  to  know  precisely  what  kind  of  structures  to 
associate  with  these  coherent  localized  oscillations,  Al¬ 
ford  and  Pinkel  suggested  that  these  were  internal- wave 
packets.  Here  we  hope  to  validate  that  identification 
to  a  certain  extent  by  showing  how  an  idealized  packed 
would  propagate  through  a  simplified  model  of  the  ther- 
mocline,  and  by  showing  that  the  observed  overturn¬ 
ing  scales  could  be  consistent  with  internal- wave  packet 
propagation. 

Given  current  computer  resources,  it  would  not  be 
practical  to  simulate  all  of  the  scales  that  are  relevant 
to  the  breaking  of  waves  in  the  buoyancy  range.  The 
forcing  of  the  waves  is  thought  to  result  from  a  com¬ 
plicated  interaction  of  many  internal  waves  with  scales 
ranging  in  the  vertical  from  lO’s  of  meters  to  hundreds 
of  meters  and  more,  and  typical  horizontal  internal  wave 
scales  can  be  even  much  greater  than  these.  On  the 
small-scale  end  of  the  spectrum,  the  observed  break¬ 
ing  is  occurring  on  vertical  scales  of  the  order  of  1  m, 
and  these  breaking  events  produce  turbulence  that  ex¬ 
tends  down  to  a  viscous  cutoff  on  the  order  of  1  cm. 
Thus  direct  numerical  simulation  of  the  entire  range  of 
scales  is  still  impractical.  We  will  use  a  combination 
of  LES  modehng  and  an  artificial  model  of  the  large 
scale  forcing  in  order  to  reduce  the  spectral  range  that 
we  will  need  to  cover.  To  attack  our  first  problem  of 
investigating  how  waves  at  the  short-scale  end  of  the 
Garrett-Munk  (1975)  spectrum  go  unstable  and  break 
in  the  buoyancy  range,  we  have  used  an  artificial  forcing 
with  length-scales  fixed  at  20  m  in  the  vertical  and  20 
m  in  the  horizontal  to  represent  the  effect  of  all  larger 
scales.  At  the  small-scaJe  end  of  the  simulation,  we  have 
introduced  an  eddy  viscosity  with  a  cutoff  at  the  16  cm 
level  in  both  horizontal  and  vertical  directions.  Thus 
our  model  does  some  violence  to  the  true  physics  at 
the  large  and  small-scale  ends  of  the  simulated  range. 
However,  the  hope  is  that  it  will  do  justice  to  the  evo¬ 
lution  in  the  buoyancy  range.  This  model  does  prove 
capable  of  capturing  the  transition  from  the  buoyancy 
to  the  inertial  range.  For  the  problem  of  wave-packet 
propagation,  which  is  perhaps  the  source  of  the  order 
10  m  scale  variability  most  correlated  with  overturning, 
we  needed  to  expand  our  domain  size  in  order  to  allow 
for  the  propagation  and  evolution  of  the  packet. 

Forced  20-meter  wave 

Our  first  goal  is  to  determine  to  what  extent  our 
simulations  can  capture  the  transition  from  the  buoy¬ 
ancy  range  to  the  inertial  range  in  the  energy  spec¬ 


tra.  Constructing  a  theory  of  this  transition  is  compli¬ 
cated  because  of  the  anisotropy  of  the  buoyancy  range. 
To  make  progress,  some  theoretical  formulations  have 
represented  the  entire  spectrum  as  depending  only  on 
wavenumber  k.  The  model  for  the  kinetic  energy  spec¬ 
trum  in  the  buoyancy  range  is  then 


E(k)  =  aN^k 


2 1,-3 


(1) 


where  a  is  an  empirical  constant  and  N  is  the  Brunt- 
Vms^a  frequency,  which  measures  the  strength  of  the 
stratification.  The  Brunt- Vaisala  frequency  is  defined 
by 


po  dz 


(2) 


where  g  is  the  acceleration  of  gravity,  p  is  the  back¬ 
ground  density  profile,  assumed  stable  (i.e.  dp/dz  <  0), 
and  Po  is  the  volume  average  of  p.  Prom  the  observed 
spectra  of  vertical  shear,  the  constant  a  is  determined 
to  be  about  0.47,  but  it  will  be  more  convenient  for  us 
to  consider  the  two  components  of  the  horizontal  veloc¬ 
ity  (u,  u)  separately,  and,  assuming  horizontal  isotropy 
in  the  observations,  this  would  suggest  a  «  0.2  for  the 
spectrum  of  either  component  (cf.  Gibson  1986,  Gargett 
et  0,1.  1981).  The  inertial  range  kinetic  energy  spectrum 
is  given  by 

E{k)  =  (3) 

where  e  is  the  turbulent  dissipation  rate  of  total  kinetic 
energy  and  Ck  is  the  empirical  Kolmogorov  constant. 
A  reasonable  value  to  assume  for  Ck  is  1.5  (cf.  Lesieur, 
1997).  For  the  energy  of  one  component  of  the  velocity 
field,  there  would  simply  be  a  prefactor  of  1/3  multi¬ 
plying  this  isotropic  spectrum.  The  Ozmidov  (or  buoy¬ 
ancy)  wavenumber  is  then  estimated  by  simply  match¬ 
ing  these  two  spectra  at  wavenumber  kb-  The  result  is, 
up  to  an  order  one  multiplicative  constant. 


kb  =  ^/I^€, 


(4) 


(cf.  Holloway,  1981;  Gibson  1986). 

The  model  for  the  potential  energy  spectrum  in  the 
buoyancy  range  is  similar  to  that  for  the  kinetic  en¬ 
ergy  spectrum.  The  empirical  constant  a  for  the  tem¬ 
perature  spectrum  is  found  to  have  value  of  about  0.2 
(cf.  Gibson  1986,  Gregg  1977).  The  spectral  model  for 
the  inertial  range  of  density  fluctuations  is  the  Corrsin- 
Obukhov  spectrum,  which  involves  the  decay  rate  of 
density  fluctuations  as  well  as  e.  For  our  purposes,  we 
prefer  to  write  the  spectrum  directly  in  terms  of  the 
turbulent  decay  rate  of  potential  energy,  which  we  shall 
write  as  Cp^.  Then  the  Corrsin-Obukhov  spectrum  for 
the  potential  energy  takes  the  following  form: 

PE{k)  =  C,ep,e-^/^k-^f\ 


(5) 
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where  Co  is  the  Corrsin  constant. 

For  all  of  the  simulations  presented  here,  we  have 
used  the  Boussinesq  approximation.  The  evolution 
equations  may  be  written  as 

|  +  u.Vu+lvp'-^g  =  .(V^)VV  (6) 

V  •  u  =  0.  (7) 

^  +  u.Vp'  +  u;|^  =  k(V2)VV,  (8) 

where  i/(-)  and  «(•)  are  considered  functions  of  the 
Laplacian  operator  and  are  used  to  represent  eddy  pa- 
rameterizations  in  general  (cf.  Herring  and  Metals 
1992)  and  g  =  — pz.  We  have  neglected  the  effect  of  ro¬ 
tation,  which  should  not  play  a  major  role  at  the  small 
scales  with  which  we  are  concerned.  The  total  density 
is  given  by 

P  =  p{z)+p\x,y,z,t),  (9) 

where  p\x,y^z,t)  is  the  deviation  from  the  horizona- 
tally  averaged  density  p{z).  po  is  the  average  of  p(z) 
over  z.  The  pressure  p'  is  the  deviation  from  the  back¬ 
ground  mean  pressure,  p'  can  be  determined  in  terms  of 
u  by  taking  the  divergence  of  (6)  imder  the  assumption 
that  the  velocity  field  is  divergenceless. 

We  simulate  these  dynamical  equations  with  a  spec¬ 
tral  code  with  triply  periodic  boundary  conditions.  As  a 
sub-grid  scale  parameterization,  we  have  used  the  large- 
eddy  simulation  model  of  Lesieur  and  Rogallo  (1989). 
This  eddy  viscosity  Vt{k)  is  approximately  constant 
throughout  the  buoyancy  range  and  the  large-scale  end 
of  the  inertial  range,  but  increases  rapidly  with  k  in  the 
vicinity  of  the  spectral  cutoff  kc-  Due  to  the  spectral 
shape  of  the  eddy  viscosity,  this  model  is  sometimes 
called  the  cusp  model.  It  seems  reasonable  in  modeling 
the  buoyancy  range  to  use  such  a  model  since  it  does 
not  completely  neglect  the  effects  of  imresolved  eddies 
on  the  buoyancy  range,  but,  at  the  same  time,  it  puts 
the  strongest  eddy  viscosity  in  the  inertial  range  near 
the  cutoff. 

We  should  emphasize  the  point  that  the  size  of  the 
eddy  viscosity  depends  on  the  amount  of  energy  at  the 
cutoff  scale.  If  the  resolution  of  the  simulation  of  a  given 
physical  flow  is  increased,  that  is  if  kc  is  increased,  then 
the  eddy  viscosity  will  be  correspondingly  smaller.  The 
total  viscosity  used  in  the  simulations  is  the  sum  of 
the  eddy  viscosity  and  the  constant  molecular  viscosity 
I'moh  Thus  the  z/(V^)  in  equation  (6)  in  the  spectral 
simulation  is  taken  as  the  total  viscosity: 

Z/(fc)  =  ^mol  H”  ^t(fc)  (^^) 

The  choice  of  turbulent  diffusion  depends  on  the 
choice  of  values  for  various  parameters  that  enter  into 


the  closure  model  for  stratified  turbulence.  For  simplic¬ 
ity,  we  have  just  taken  the  turbulent  Prandtl  number 
Prt{k)  to  be  a  fixed  constant  independent  of  k  in  our 
simulations.  We  determined  this  constant  by  examin¬ 
ing  the  evolution  of  the  potential  energy  spectrum  for 
decaying  stratified  turbidence  that  is  initially  highly  ex¬ 
cited  at  all  scales.  More  specifically,  we  started  with  an 
initial  spectrum  in  which  the  GM  spectrum  was  contin¬ 
ued  to  scales  below  10  m  as  in  the  decay  simulations 
of  Siegel  and  Domaradzki  (1994).  With  Prt  =  0.55 
our  simulations  of  decaying  turbulence  produced  spec¬ 
tra  with  the  high  wavenumbers  obeying  the  law 
for  both  velocity  and  density  fluctuations. 

Next  we  turn  to  the  question  of  the  forcing.  The 
large-scale  flows  that  actually  drive  the  buoyancy  range 
are  predominantly  the  waves  of  the  Garrett-Munk  range. 
The  full  range  where  internal  wave  dynamics  dominates 
includes  scales  of  kilometers  in  the  horizontal  and  hun¬ 
dreds  of  meters  in  the  vertical.  Because  of  lack  of  reso¬ 
lution,  we  cannot  provide  a  full  representation  of  the 
effects  of  all  large-scale  internal  wave  forcing  on  the 
buoyancy  range.  In  our  model,  of  necessity,  we  perform 
a  drastic  reduction  in  modeling  the  forcing;  we  replace 
the  driving  of  all  of  the  GM  waves  by  a  linear  stand¬ 
ing  wave  at  one  wavelength.  Bouruet-Aubertot  et  al 
(1995,  1996),  in  two-dimensional  simulations  of  a  strat¬ 
ified  turbulence,  excited  a  standing  wave  of  the  type 
we  use,  but  they  allowed  this  wave  to  decay,  whereas 
we  maintain  its  amplitude  at  the  same  level  throughout 
the  simulation. 

To  give  the  form  of  the  forcing  used,  let  us  first  in¬ 
troduce  nondimensional  units.  We  will  take  all  lengths 
to  be  scaled  by  27r/L,  where  L  is  the  length  of  one  side 
of  our  computational  domain.  Time  will  be  scaled  by 
1/N.  Then  the  frequency  of  linear  internal  waves  is 
given  by 

^  =  (11) 

where  kh  =  “^he  horizontal  wavenumber. 

One  particular  linear  standing  wave  is 


u  =  {u,v,w)  =  A-^(0,sinysin2;,cosycosz)sin  — , 
v2  ^/2 

,  (12) 

P  a  ^ 

—  =  A  cos  2/ COS  2:  cos (13) 
Po  V2 

where  A  is  an  arbitrary  amplitude  and  is  the  nondi¬ 
mensional  gravity.  Note  that  the  dimensional  period  of 
this  wave,  which  is  the  forcing  period,  is  given  by 


(14) 


To  give  some  idea  of  the  structure  of  this  standing 
wave,  we  show  in  Figure  1  a  contour  plot  of  the  density 
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Figure  1.  Contour  plot  of  the  density  field  in  a  vertical 
y-z  cross  section  through  the  center  of  the  domain.  The 
width  and  height  of  the  cross  section  are  each  20  m, 
which  corresponds  to  27r  in  nondimensional  units.  The 
instant  shown  corresponds  to  maximum  displacement 
of  the  isopycnals  for  the  forced  standing  wave. 


field  in  a  vertical  y-z  cross  section.  Note  that  the  den¬ 
sity  field  in  this  standing  wave  has  no  variation  along 
the  X  direction.  In  this  figure,  we  see  an  instantaneous 
representation  of  the  iso-density  surfaces.  When  t /  is 

an  odd  multiple  of  7r/2,  these  isopycnals  will  all  be  flat. 
The  degree  to  which  they  deviate  from  that  at  other 
times  depends  on  the  value  of  A  as  well  as  t.  The  instant 
of  time  represented  here  is  such  that  t/y/2  is  an  integer 
multiple  of  tt  and,  hence,  one  of  maximum  distortion  of 
the  density  contours.  Note  that  the  density  field  in  (13) 
has  two  nodal  planes,  represented  by  two  thick  contour 
lines  in  the  figure,  at  z  =  ±7r/2  (nondimensional).  Dur¬ 
ing  the  forcing  cycle,  these  planes  remain  flat  and  fixed 
in  position.  The  fluid  above  and  below  these  planes  ver¬ 
tically  approaches  and  retreats  from  them  depending  on 
the  phase  in  time  and  the  y  position  considered.  Thus 
the  points  on  the  nodal  planes  at  y  =  0  and  y  =  ±7r 
are  the  centers  of  regions  of  oscillating  high  strain  rate. 
On  the  other  hand  the  points  where  the  isopycnals  are 
steepest,  that  is  where  y  =  ±7r/2,  ±37r/2  and  z  =  0,±n 
the  magnitude  of  the  shear  dvjdz  is  highest.  Thus  one 
advantage  of  the  standing  wave  forcing  is  that  the  points 
of  highest  shear  and  highest  strain  rate  remain  fixed  in 
space  making  it  easier  to  differentiate  the  kinds  of  over¬ 
turning  events  associated  with  these  extremes.  This  wdll 
be  convenient  for  making  comparisons  with  Alford  and 


PinkePs  (2000)  analysis.  In  particular,  they  noted  that 
there  were  regions  of  high  shear  where  classical  shear 
instabilities  often,  but  not  always,  resulted  in  overturn¬ 
ing.  Even  more  more  interestingly,  there  were  regions  of 
high  Ri  in  which  overturns  were  also  observed.  In  more 
than  half  of  these  cases  Ri  was  even  greater  than  2  sug¬ 
gesting  that  the  typical  shear  instability  (Ri  <  1/4)  is 
unlikely.  Many  of  these  overturns  were  in  regions  of  high 
vertical  strain  rate.  With  our  standing  wave  forcing,  the 
regions  of  high  shear  and  high  strain  rate  are  separate 
and  each  occur  in  the  same  location  during  each  forcing 
oscillation.  This  helps  simplify  the  analysis. 

In  two-dimensional  numerical  studies  of  Bouruet- 
Aubertot  et  al.  (1996),  the  standing  wave  becomes  un¬ 
stable  and  generates  turbulence.  This  would  also  hap¬ 
pen  in  our  three-dimensional  simulation,  but  the  tur¬ 
bulence  would  be  highly  constrained  since  there  is  yet 
no  source  of  x-variation  in  our  flow.  To  break  the  two- 
dimensional  symmetry  of  the  flow,  while  maintaining 
the  basic  structure  of  the  large  scale,  we  add  a  weak 
component  of  forcing  with  x-variation.  We  have  tried 
this  in  various  ways:  adding  a  random  initial  pertur¬ 
bation  at  all  scales,  randomly  forcing  the  modes  with 
fc  =  1  at  each  time  step,  adding  another  large-scale 
standing  wave,  adding  a  propagating  wave,  and  so  on. 
The  results  are  similar  to  each  other  if  the  perturbations 
are  sufficiently  weak.  For  the  simulations  discussed  be¬ 
low,  we  have  added  to  the  primary  forcing  wave  only  a 
small  amplitude  standing  wave  of  the  same  spatial  scale. 
Specifically,  we  added  the  following  perturbation: 


—  ^^;^(cos(x  +  z),0,  -cos(x  -t-  z))sin 


(15) 


—  =  A'  cos(x  -f  z)  cos  -t=. 
Po  '  ^/2 


(16) 


Thus  in  the  simulations  discussed  below  the  forcing  oc¬ 
curs  only  at  fc  =  y/2.  The  coefficient  A'  was  taken  to 
be  AI2Q,  and,  hence,  the  energy  in  the  perturbation  is 
only  1/400  that  of  the  primary  forcing  wave. 

We  performed  a  series  of  experiments  in  which  the 
size  of  the  computational  domain  and  the  amplitude  of 
the  forcing  were  varied.  The  initial  studies  were  at  res¬ 
olution  64^  and  showed  that  for  sufficiently  large  am¬ 
plitudes  A  for  which  the  forcing  wave  itself  was  over¬ 
turning,  &  spectrum  extending  over  most  of  the 

spectral  range  could  be  established.  For  weaker  forc¬ 
ing,  a  steeper  spectrum  approximating  k~^  was  found 
{Camevale  and  Briscolini,  1999).  For  intermediate  am- 
phtude  forcings,  we  were  able  to  observe,  at  least  inter¬ 
mittently,  cases  which  do  appear  to  exhibit  the  transi¬ 
tion  from  the  buoyancy  range  to  the  smaller  scale  in¬ 
ertial  range.  Weak  and  strong  forcings  are  measured 
relative  to  shear  amplitudes  typical  in  the  thermocline. 
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The  best  results  were  obtained  with  a  forcing  ampli¬ 
tude  that  could  actually  be  considered  representative 
of  wave  amplitudes  in  the  thermocline.  Specifically,  the 
forcing  amplitude  that  we  refer  to  as  intermediate,  is  for 
a  value  of  A  in  equations  (12)  and  (13)  such  that  the 
maximum  shear  during  a  cycle  of  the  forcing  is  equiva¬ 
lent  to  the  rms  shear  of  the  GM  spectrum  at  the  scale  of 
our  computational  domain.  The  rms  shear  is  calculated 
by  integrating  the  shear  of  the  GM  spectrum  from  the 
kilometer  scale  down  to  the  scale  of  interest  (cf.,  Gregg, 
1989).  Our  best  results  tended  to  be  for  cases  in  which 
the  vertical  wavelength  of  the  forcing  was  20  m.  For 
iV  =  3  cph,  the  net  rms  shear  from  the  GM  spectrum 
for  this  scale  is  SgmC^O  m)w  3  x  10“®s“^  (cf.,  Gregg, 
1989).  Taking  this  value  to  determine  the  amplitude 
of  our  forcing,  we  obtain  a  standing  wave  in  which  the 
largest  deviation  of  the  density  isosurfaces  are  as  illus¬ 
trated  in  Figure  1.  Thus  we  have  a  standing  wave  that 
does  not  itself  overturn  during  the  forcing  cycle,  and, 
in  addition,  the  Richardson  number  of  the  forcing  wave, 
defined  by 


po  dz 


(17) 


does  not  drop  below  3.125.  Therefore,  the  forcing  wave 
itself  is  convectively  stable  and  not  subject  to  shear  in¬ 
stability.  This  kind  of  forcing  is  consistent  with  the  pic¬ 
ture  that  the  GM  waves  themselves  are  not  convectively 
or  shear  unstable,  but  through  wave- wave  interactions 
will  produce  smaller  scale  waves  that  are  unstable  by 
these  criteria.  Choosing  a  stronger  forcing  wave  that 
is  itself  convectively  or  shear  unstable  would  miss  the 
important  cascade  process  that  produces  the  unstable 
waves  of  the  buoyancy  range,  but  would  rather  produce 
turbulence  directly  resulting  in  an  inertial  range  (cf. 
Camevale  and  Briscoliniy  1999). 

For  all  of  the  simulations  discussed  below,  we  used 
a  resolution  128^  and  a  computational  cube  of  20  m  on 
a  side.  Our  isotropic  spectral  cutoff  is  at  wavenumber 
60,  and  the  smallest  resolved  wavelength  is  about  33 
cm  (with  grid  spacing  20  m/128  16  cm).  The  forc¬ 

ing  amplitude  was  fixed  so  that  the  max  shear  in  the 
forced  wave  is  jSga4‘(20  m),  and  the  Vaisala  frequency 
was  taken  to  be  3  cycles  per  hour,  which  is  a  typical 
oceanic  value. 


A  long  simulation  was  performed  with  realistic  values 
for  the  molecular  viscosity  and  diffusivity.  The  kine¬ 
matic  viscosity  was  set  to  Umoi  =  O.Ol  cm^/s  and  the 
molecular  Prandtl  number  at  Ptmoi  =  7  (cf.  Gargett, 
1985).  We  can  calculate  a  Reynolds  number  for  the 
oceanic  flow  for  vertical  motions  on  the  20  m  scale  by 
using  the  rms  shear.  Thus  we  can  write 


Re  —  SGMiL)L^ /i/frioi*  (18) 


For  L  =  20  m,  this  Reynolds  number  would  be  ap¬ 
proximately  10^.  By  including  the  molecular  viscosity, 
the  simulation  is  an  attempt  to  represent  flow  with  this 
Reynolds  number.  We  will  see  that  there  is  not  much 
difference  with  results  obtained  by  neglecting  the  molec¬ 
ular  components  of  viscosity  and  diffusivity.  That  is  to 
say  that  over  the  range  of  scales  simulated  (20  m  to  33 
cm)  the  difference  between  infinite  Reynolds  number 
flow  and  that  for  Re  =  10^  is  small. 

We  can  think  of  our  standing  wave  forcing  as  the 
linear  susperposition  of  a  set  of  propogating  internal 
waves.  To  be  precise,  the  combination  of  the  two  stand¬ 
ing  waves  given  in  (12)  and  (15)  consists  of  12  propagat¬ 
ing  plane  waves.  These  wave  interact  nonhnearly  pro- 
ducins  smaller-scales  that  eventually  fill  out  the  entire 
spectrum.  The  early  evolution  is  essentially  just  that  of 
the  nearly  two-dimensional  standing  wave.  During  this 
time  there  are  only  sinusoidal  waves  on  the  most  dis¬ 
turbed  isosurface,  but  these  waves  then  fold  over  form¬ 
ing  elongated  overturns.  These  regions  are  convectively 
unstable  and  break.  At  this  point  the  three  dimension¬ 
ality  of  the  flow  becomes  evident. 

After  about  five  cycles  of  the  forcing,  the  large-scale 
wave  breaks  repeatedly,  however,  not  necessarily  during 
each  forcing  period.  The  wave  breaking  on  the  most 
disturbed  isopycnal  occurs  roughly  symmetrically  with 
large-scale  overturning  occurring  nearly  at  the  same  val¬ 
ues  of  y  and  z  each  time  and  along  lines  of  constant  x, 
respecting  in  the  large  scales  the  symmetry  of  the  main 
part  of  the  forcing.  However,  no  two  breaking  events 
with  the  subsequent  evolution  during  the  forcing  cy¬ 
cle  are  the  same.  In  Figure  2,  for  one  such  cycle,  we 
show  eight  instantaneous  images  of  this  isosurface  us¬ 
ing  a  perspective  three-dimensional  plot.  The  frames 
are  ordered  temporally  from  left  to  right  and  top  to 
bottom.  The  first  frame  in  the  upper  left  hand  cor¬ 
ner  corresponds  to  t  =  llMTp,  and  the  interval  be¬ 
tween  frames  is  At  =  Tp/7.  Thus  the  first  and  last 
frames  correspond  to  the  same  phase  of  the  forcing. 
The  first  frame  captures  the  moment  when  breaking 
is  just  beginning.  Let  us  say  that  the  first  four  frames 
represent  the  breaking  event,  and  the  last  four  the  af¬ 
termath.  We  see  that  during  the  breaking  event,  heavy 
fluid  spills  over  lighter  fluid,  crashing  down  with  the  cre¬ 
ation  of  small-scale  structures  all  along  the  lines  of  the 
two  breaking  regions.  Similar  behavior  is  in  laboratory 
experiments  with  standing- wave  forcing  {Taylor  1992, 
McEwan  1983a).  Afterwards,  the  region  of  the  small- 
scale  turbulent  structures  spreads,  eventually  ^contam¬ 
inating’  the  entire  isosurface.  If  we  compare  the  final 
frame  with  the  first  frame,  we  see  that  the  final  surface 
is  much  rougher,  fiilled  with  small-scale  structures  ev¬ 
erywhere,  and  that  there  is  no  larger  scale  folding-over 
of  the  surface  as  there  was  in  the  first  frame.  In  the 
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Figure  2.  The  evolution  of  the  p  =  po  isopycnal  during  one  cycle  of  the  forcing.  The  frames  are  ordered  by  time 
from  left  to  right  and  top  to  bottom.  The  first  corresponds  to  t  =  ll.SQTp,  and  the  interval  between  frames  is 
At  =  TF/7. 
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later  evolution,  the  wave  will  brealc  again,  but  only  af¬ 
ter  a  refractory  period,  in  this  case  of  about  two  forcing 
cycles. 

Next  we  will  consider  the  energy  spectra  for  the  flow 
at  the  same  times  as  those  illustrated  in  Figure  2.  Since 
the  energy  is  highly  anisotropic  at  scales  larger  than 
those  in  the  inertial  range,  plotting  the  total  energy  as 
a  function  of  the  isotropic  wavenumber  tends  to  obscure 
the  transition  between  small  and  large  scales.  To  most 
clearly  display  the  transition,  we  have  found  it  useful  to 
consider  the  spectrum  Ey(k)  of  v,  the  y  component  of 
velocity,  which  is  the  horizontal  component  that  is  di¬ 
rectly  affected  by  the  forcing.  Along  with  the  spectra, 
we  have  also  drawn  lines  corresponding  to  the  inertial 
range  spectrum  and  buoyancy  range 

spectrum  For  each  frame,  e  is  taken  as  the 

total  kinetic  energy  dissipation  rate  at  that  time.  We 
have  included  a  factor  of  (1/3)  which  is  appropriate 
for  a  single  component  in  the  isotropic  inertial  range. 
For  the  Kolmogorov  constant,  a  value  of  1.5  was  used 
in  each  case.  We  should  emphasize  that  no  attempt 
is  made  here  to  fit  the  data,  but  the  coefficient  is  just 
taken  as  this  standard  value  a  priori.  For  the  buoyancy 
range  spectrum,  we  have  used  the  coefficient  a  =  0.2 
in  all  cases.  In  each  frame  shown  in  Figure  3,  we  see  a 
fairly  good  match  at  wavenumbers  greater  than  about 
20  (that  is  for  scales  below  about  1  m)  to  the  Kol¬ 
mogorov  inertial  range  spectrum.  The  main  deviation 
is  at  wavenumbers  near  fc  =  60,  the  cutoff  wavenumber, 
and  this  is  to  be  expected  from  previous  experience  with 
the  cusp  model  (cf.  Lesieur  and  Rogallo,  1989).  The 
spectnmi  below  wavenumber  20  is  naturally  far  more 
irregular  than  that  above  due  to  the  much  smaller  num¬ 
ber  of  modes  in  the  lower  spectral  bands.  If  we  neglect 
the  first  few  wavenumbers,  then  there  is  some  evidence 
here  for  a  steeper  spectral  range  for  wavenumbers  below 
about  k  =  20,  that  is  for  scales  larger  than  about  1  m, 
at  least  in  the  frames  that  correspond  to  times  during 
the  breaking  of  the  wave  (first  four  panels).  In  the  af¬ 
termath  of  breaking,  the  spectra  tend  to  be  somewhat 
flatter  (the  last  four  panels).  The  best  representative 
of  the  transition  between  buoyancy  and  inertial  range 
is  found  in  panel  (c),  which  corresponds  to  a  time  when 
the  enstrophy  is  near  a  local  maximum.  Here  the  buoy¬ 
ancy  range  spectrum  makes  a  reasonably  good  fit  in 
the  range  of  scales  from  about  4  m  down  to  about  1  m. 
From  the  forcing  scale  (20  m  vertical)  to  about  the  5  m 
scale,  there  is  a  dip  in  the  energy  that  has  also  been  seen 
in  the  spectra  from  similar  two-dimensional  simulations 
of  the  decay  of  a  standing  wave  {Bouruet-Aubertot  et  ai 
1996). 

In  this  experiment  it  appears  that  the  expected  spec¬ 
tral  signature  of  a  transition  between  a  buoyancy  range 
at  large  scale  and  the  inertial  range  at  small  scale  oc¬ 


curs  only  for  periods  during  which  there  is  active  break¬ 
ing.  Indeed,  it  appears  that  wave-wave  interactions  re¬ 
peatedly  build  up  energy  in  the  buoyancy  range  until 
a  spectrum  is  achieved.  At  that  point  significant 
breaking  occurs  and  energy  drains  from  the  buoyancy 
range.  Let  us  focus  on  the  breaking  event.  In  Fig¬ 
ure  4a  is  the  image  of  the  />  =  po  isosurface  at  the 
time  identified  as  the  best  for  illustrating  the  spectral 
transition  from  the  buoyancy  to  the  inertial  range.  It 
shows  the  curling  over  and  spilling  down  or  plunging  of 
the  heavier  fluid  over  lighter,  while  Figure  4b  suggests 
mixing  by  the  appearance  of  many  small-scale  struc¬ 
tures  along  the  two  parallel  lines  of  the  breaking  wave. 
The  corresponding  spectra  for  all  three  components  of 
kinetic  and  for  the  potential  energy  are  shown  in  Fig¬ 
ure  5.  First  we  notice  that  although  the  spectra  are 
highly  anisotropic  from  the  forcing  scale  (20  m)  down 
to  about  the  1  m  scale,  there  is  an  approximate  ‘re¬ 
turn’  to  isotropy  for  the  smaller  scales.  This  is  particu¬ 
larly  evident  in  the  kinetic  energy  spectra  for  t  =11.82 
Tf  (panel  c).  In  panels  (a)  and  (c),  we  have  made  an 
attempt  to  draw  the  best  fit  inertial  range  spectra  to 
determine  the  appropriate  Kolmogorov  constants  (Ck) 
that  fit  these  data.  We  did  this  for  the  Ey  (k)  spectra, 
obtaining  the  best  fit  ‘by  eye’  from  enlarged  portions 
of  the  small  scale  spectra.  The  result  that  was  used  to 
draw  the  inertial  range  model  spectra  in  panels  (a)  and 
(c)  is  (Ck)  =  1-4.  In  panels  (b)  and  (d),  the  poten¬ 
tial  energy  spectra  are  drawn.  In  these  panels  the  small 
scales  were  fit  to  the  Corrsin-Obukhov  spectrmn  to  de¬ 
termine  the  appropriate  Corrsin  constant.  In  panels  (b) 
and  (d)  the  Corrsin  constants  used  to  draw  the  model 
Corrsin-Obukhov  spectrum  were  Co  =  0.83  and  0.8  re¬ 
spectively.  In  all  panels  the  model  buoyancy  range 
spectrum  drawn  is  0.2N'^k^^.  Thus  the  Kolmogorov 
constant  found  here  is  somewhat  smaller  than  the  em¬ 
pirical  values  of  1.5  and  the  Corrsin  constant  is  some¬ 
what  larger  than  the  empirical  value  of  0.67.  Never¬ 
theless,  the  values  are  remarkably  close  to  the  empiri¬ 
cal  values,  given  that  the  spectral  width  of  the  inertial 
range  here  only  covers  wavelengths  from  about  1  me¬ 
ter  to  about  33  cm.  Also  the  near  collapse  of  the  three 
kinetic  energy  spectra  for  small  scales  is  encouraging. 
Thus  it  seems  that  the  subgrid  scale  model  is  work¬ 
ing  well  at  small  scales  and  that  the  d3mamics  of  the 
transition  from  anisotropic  buoyancy  to  the  isotropic 
inertial  range  is  acting  as  imagined  in  theoretical  mod¬ 
els.  Finally,  we  should  note  that  the  value  of  e  from  the 
simulations  is  about  one  third  of  the  value  observed  by 
Alford  and  Pinkel  (2000)  associated  with  values  of  iV  = 
3  cph.  This  appears  quite  reasonable  given  the  level 
of  modeling  we  have  had  to  employ  for  the  forcing  and 
subgrid  scale  vortices. 

Besides  the  kinetic  and  potential  energy  spectra,  we 


Figure  3.  Ki 
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netic  energy  spectra  for  the  v  component  of  the  velocity  during  one  cycle  of  the  forcing.  The  dashed 
ind  to  the  Kolmogorov  spectrum  (l/3)Cif€2/3^-5/3  Ck  =  1.5  and  the  saturation  spectrum 
le  time  interval  between  frames  is  ATf/7  and  the  first  frame  corresponds  to  time  t  =  11  397^  These 
rts  of  Eik)  in  units  of  N^{L/2irf  vs.  k  in  units  of  2-k/L.  All  plots  have  the  scales  as  indicated  in 
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Figure  4.  A  breaking  event  visualized  on  the  p  —  po  isopycnaJ.  These  are  enlargements  of  the  images  shown  in 
the  composite  Figure  2  in  panels  3  and  4,  corresponding  to  times  (a)  11.68  and  (b)  11.82  Tp  (one  seventh  of  a 
forcing  period  apart). 
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Figure  5.  (a)  Kinetic  energy  spectra  for  all  three  components  of  the  velocity  at  time  t  =  ll.eSTir.  The  thick  Ion®- 
d^hed,  solid  and  short  dashed  Hnes  correspond  to  the  energy  spectra  for  the  u,v,  and  w  components  respectively^ 
The  thin  solid  hn^  correspond  to  the  Kolmogorov  spectrum  (l/3)C;f£2/3A:-5/3  ^th  Ck  =  1.4  and  the  saturation 
spectrum  0.2Ar  k  .  (b)  Potential  energy  spectrum  at  time  t  =  llSSTp.  The  thick  solid  line  corresponds  to  the 
potential  energy  spectrum.  The  thin  solid  hnes  correspond  to  the  Corrsin-Obukhov  spectrum  with 

Co  =  0.83  and  the  saturation  spectrum  0.2N^k-^.  (c)  As  in  (a)  but  for  t=11.82  Tp  and  Ca-  =  1  4  fd)  As  in  (h) 
but  for  t=11.82  Tp  and  Co  =  0.80.  ^  ^  ^ 
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can  also  find  predictions  for  the  buoyancy  flux  spectrum 
in  both  the  theory  of  Lumley-Shur  (cf.  Lumley  1964, 
1967,  Phillips  1967 ^  Weinstock  1985)  and  the  theory 
of  Holloway  (1983,  1986).  The  modal  spectrum  of  the 
buoyancy  flux  can  be  written  as 

-gn  <  >  /po.  (19) 

If  this  quantity  is  positive,  then  for  wavevector  k  there 
is  conversion  of  potential  energy  to  kinetic  energy,  and 
vice  versa  if  it  is  negative. 

The  prediction  of  the  Lumley-Shur  theory  for  the 
buoyancy  flux  spectrum  in  the  buoyancy  and  inertial 
ranges  is 

BFik)  =  -2£»g  (l  +  Dih/k)^^^)  {k„lk)^l^  (20) 

where  is  as  defined  in  (4)  and  £)  is  a  constant.  Lumley 
(1964)  assumed  the  buoyancy  flux  to  be  negative  and, 
hence,  D  to  be  positive.  In  displaying  his  final  result, 
Lumley  incorporated  D  into  his  definition  for  kb,  but 
we  will  leave  it  explicit.  Lumley ’s  prediction  of  negative 
buoyancy  flux  through  the  buoyancy  and  inertial  ranges 
is  just  the  opposite  of  what  we  have  foimd  numerically 
for  our  wave-forced  problem.  All  of  the  ingredients  for 
an  alternative  prediction  of  the  buoyancy  flux  are  given 
in  Holloway  (1983),  and  based  on  this  we  have  derived 
the  same  prediction  as  given  in  (20,  but  with  the  sign 
of  D  clearly  arbitrary  (for  details  see  Camevale  et  oZ., 
2001). 

In  Figure  6a,  we  plot  the  buoyancy  flux  spectrum 
from  our  simulation  as  a  function  of  k.  This  is  a  time 
averaged  spectrum,  where  we  have  averaged  over  a  pe¬ 
riod  of  6Tp,  with  time  increment  of  Q.lTp.  The  time 
averaging  is  necessary  to  remove  temporal  fluctuations 
in  the  large-scales.  Note  that  the  buoyancy  flux  spec¬ 
trum  is  negative  for  large  scales  {I  <  k  <  3),  and  pos¬ 
itive  for  smaller  scales.  This  implies  a  transformation 
of  kinetic  to  potential  energy  at  large  scales  (closest  to 
the  forcing  scale  k  =  y/2)  and  a  transfer  of  potential 
to  kinetic  energy  at  all  smaller  scales.  Since  our  ob¬ 
served  buoyancy  flux  spectrum  is  positive  through  both 
the  buoyancy  and  inertial  ranges,  it  can  be  compared  to 
the  theoretical  prediction  given  by  (20)  only  by  choos¬ 
ing  a  negative  value  for  D,  To  define  the  constant  D, 
we  note  that  the  wavenumber  where  the  buoyancy  flux 
vanishes  is  determined  by  D.  Here  we  shall  choose  D 
so  that  the  zero  value  occurs  at  fc  =  3.5  (corresponding 
in  our  simulation  to  a  wavelength  of  5.7  m)  since  our 
buoyancy  flux  was  found  to  vanish  between  fc  =  3  and 
fc  =  4.  The  theory  will  apply  only  above  this  wavenum¬ 
ber,  and  we  can  think  of  this  as  the  lower  limit  on  the 
buoyancy  range,  or  the  upper  wavenumber  of  the  Gar¬ 
rett  Munk  spectnun  in  the  schematic  shown  in  our  in¬ 
troduction.  To  compute  kb,  given  by  (4),  we  use  the 
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Figure  6.  (a)  Graph  of  the  buoyancy  flux  spectrum 
averaged  over  6  periods  of  the  forcing,  with  10  samples 
per  period,  (b)  as  in  (a)  but  only  the  positive  portion  of 
the  spectrum  plotted  in  log-log  format  to  compare  with 
the  theoretical  spectrum  of  equation  (20)  with  negative 
coefficient  D,  The  result  from  the  simulation  is  repre¬ 
sented  by  the  thick  line,  while  the  theoretical  spectrum, 
based  on  eo  =  e  where  the  overbar  represents  time  av¬ 
eraging,  is  drawn  as  a  thin  line.  All  graphs  in  (a)  and 
(b)  are  normalized  by  e/K^. 
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time  averaged  dissipation  rate  i.  Thus  all  parameters 
in  the  theory  are  determined  by  the  wavenumber  where 
the  buoyancy  flux  vanishes  and  the  values  of  N  and  e 
(in  this  case  h  =  34.1),  The  resulting  the> 

oretical  buoyancy  flux  spectrum  is  compared  in  Figure 
6b  to  the  results  from  our  simulation.  For  wavenum¬ 
bers  in  the  buoyancy  range,  the  match  between  theory 
and  simulations  is  reasonably  good.  For  the  theoreti¬ 
cal  curve,  the  decay  with  k  is  approximately  for 
all  k  above  about  10.  The  simulation  data  follow  the 
theoretical  curve  fairly  well  up  to  about  wavenumber 
20,  where  the  simulation  spectrum  begins  to  deviate 
from  and  is  clearly  much  shallower  than  this  for 

fc  >  30.  This  shallowness  of  the  simulation  spectrum 
for  k  greater  than  about  30  is  probably  an  indication 
that  the  buoyancy  flux  is  not  captured  properly  by  the 
SGS  model  near  the  high  wavenumber  cutoff.  The  cusp 
model  viscosity  grows  rapidly  with  k  for  wavenumbers 
above  about  A:  =  30  and  is  largest  at  A; max-  This  is 
just  the  range  where  our  buoyancy  flux  spectrum  be¬ 
comes  very  shallow.  It  is  very  possible  that  the  artificial 
damping  of  the  high  k  modes  that  the  model  performs 
to  mimic  transfer  of  energy  beyond  A;max  does  not  allow 
for  the  proper  treatment  of  the  buoyancy  flux  in  that 
region.  But  this  is  not  unexpected  for  such  a  subgrid- 
scale  model. 

Positive  buoyancy  flux  for  small  scales  has  also  been 
found  in  other  simulations.  In  direct  numerical  simula¬ 
tions  (i.e.  simulations  without  subgrid  scale  modeling) 
of  forced  stratified  turbulence  in  both  two  and  three- 
dimensions,  Holloway  (1988),  and  Ramsden  and  Hol¬ 
loway  (1992)  showed  that  the  buoyancy  flux  was  nega¬ 
tive  only  at  large  scales  and  positive  at  small  scales. 
These  results  were  interpreted  as  meaning  negative 
buoyancy  flux  for  k  <  kb  (i.e.  in  the  buoyancy  range) 
and  positive  buoyancy  flux  for  higher  k.  However,  the 
forcing  used  in  their  simulations  was  spectrally  fairly 
broad,  and  it  would  not  be  inconsistent  with  their  re¬ 
sults  to  say  that  the  buoyancy  flux  was  negative  at  the 
strongly  forced  modes  and  positive  for  smaller  scales  as 
in  our  findings.  Additionally,  we  have  repeated  our  nu¬ 
merical  simulations  with  a  finite  difference  code  using 
a  Smagorinsky  eddy  viscosity,  a  very  independent  test, 
and  also  fotmd  positive  buoyancy  flux  through  the  buoy¬ 
ancy  and  inertial  ranges.  In  their  finite  difference  LES 
study  of  shear  driven  stratified  turbulence,  Kaltenbachet 
et  al  (1994)  also  found  positive  buoyancy  flux  at  small 
scales  and  negative  at  large  scales,  although  we  must 
note  that  the  region  of  negative  buoyancy  flux  in  their 
simulations  is  spectrally  very  broad  compared  to  ours. 
In  two-dimensional  flow  simulations  of  the  decay  of  a 
standing  wave  of  just  the  type  that  we  use  for  forcing 
our  flow,  Boumet-Aubertot  et  al.  (1996)  found  that  the 
buoyancy  flux  was  positive  through  most  of  the  range 


that  they  identified  with  the  buoyancy  range,  and  also 
that  the  flux  followed  a  spectral  law  in  a  run  with 
grid  resolution  256^  and  a  slightly  steeper  law  at  reso¬ 
lution  512^  (note  that  those  simulations  did  not  include 
an  inertial  range). 

There  was  some  discussion  at  the  ’Aha  Huliko’a 
meeting  about  the  possibility  that  the  sign  of  the  buoy¬ 
ancy  flux  found  in  these  simulations  is  affected  by  the 
type  of  forcing  used.  Our  forcing  inputs  both  potential 
and  kinetic  energy,  while  it  may  be  more  suitable  to  con¬ 
sider  a  source  of  kinetic  energy  alone  in  these  problems. 
If  there  were  no  explicit  external  forcing  of  the  density 
evolution,  then  the  net  buoyancy  flux  (averaged  over 
time)  would  have  to  be  negative  to  balance  the  drain 
of  potential  energy  due  to  diffusion.  This  however  does 
not  mean  that  the  buoyancy  flux  would  have  to  be  nega¬ 
tive  for  all  scales.  Further  simulations  would  be  helpful 
to  define  how  the  buoyancy  flux  spectrum  varies  as  the 
mix  of  kinetic  and  potential  energy  sources  are  changes. 

Structures  in  regions  of  high  strain  rate 

The  main  structure  of  interest  in  the  buoyancy  range 
evident  in  the  density  isosurfaces  presented  in  the  last 
section  is  the  overturn  produced  by  the  curling  over  of 
the  isosurface  in  a  manner  familiar  from  surface  wave 
breaking.  The  overturning  region  shown  in  the  breaking 
wave  illustrated  in  Figure  2c  has  a  vertical  scale  of  about 
2  meters.  This  is  similar  in  size  to  overturns  found 
in  oceanographic  measurements  in  the  buoyancy  range. 
Alford  and  Pinkel  (2000)  made  an  inventory  of  more 
than  2200  overturns.  They  found  a  median  Thorpe 
scale,  a  measure  of  the  vertical  extent  of  the  overturn,  of 
1.88  m.  Note  that  this  is  not  greatly  different  from  the 
scale  suggested  by  the  transition  point  in  the  spectra 
shown  in  Figure  5,  where  the  associated  length  scale  is 
about  1.2  m.  Since  the  observational  data  are  primarily 
one-dimensional  in  space,  it  is  difficult  to  form  a  three- 
dimensional  image  of  those  overturns.  The  ability  to 
perform  three-dimensional  analysis  of  such  structures 
is  one  of  the  benefits  of  numerical  simulation. 

Examining  the  full  density  field  more  thoroughly,  we 
also  find  interesting  structures  of  a  rather  different  na¬ 
ture  than  those  associated  with  strong  vertical  shear. 
These  can  be  represented  well  by  the  deformations  of 
the  density  surfaces  that  are  the  flat  nodal  surfaces  of 
the  forcing  wave.  We  shall  just  refer  to  these  surfaces  as 
the  nodal  surfaces  even  when  perturbed  and  deformed 
by  eddies.  The  most  basic  motion  of  the  fluid  in  the 
nodal  surfaces  is  alternately  toward  and  away  from  the 
centers  of  high  strain;  however,  the  combination  of  the 
large-scale  background  straining  motion  and  small-scale 
eddies  produces  localized  deformations  of  the  nodal  sur¬ 
face  that  can  result  in  overturning  and  mixing  in  a  man- 
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Figure  7.  The  evolution  of  three  density  isosurfaces  (p»(p  -  po)/po  =  0, 7r/2,  tt)  showing  the  evolution  of  ‘spouts’ 
from  a  nodal  surface  and  their  subsequent  coUapse  with  considerable  broadening  and  mixing.  Times  represented 
are  t  =  12.1,  12.4,  12.5,  12.7,  12.8,  and  13.1  Tp. 
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ner  different  from  the  overturns  discussed  in  the  last 
section.  By  plotting  simultaneously  three  density  iso¬ 
surfaces  (one  ‘nodal  surface’  and  the  most  strongly  per¬ 
turbed  isosurfaces  above  and  below  it)  for  a  sequence  of 
times  during  the  forcing  cycle,  we  can  get  some  under¬ 
standing  of  the  nature,  formation,  evolution,  and  fate 
of  these  structures.  In  Figure  7,  the  sequence  proceeds 
from  left  to  right,  top  to  bottom.  We  have  shifted  our 
view  of  the  computational  domain  by  an  amount  in  the 
vertical  sufficient  to  center  the  upper  ‘nodal  surface’  in 
the  image.  Above  and  below  the  ‘nodal  surface,’  the 
most  strongly  displaced  isosurfaces  are  shown.  The  up¬ 
per  and  lower  isosurfaces  move  vertically  but  always 
in  opposite  directions  at  any  horizontal  location.  The 
combined  effect  of  the  motion  of  these  surfaces  above 
and  below  the  ‘nodal  surface’  produces  vertical  ‘dila¬ 
tion’  and  ’compression’  centered  on  the  ‘nodal  surface’ 
without  producing  large-scale  sinusoidal  displacement 
of  that  surface.  In  panels  (a),  (b)  and  (c),  the  up¬ 
per/lower  surface  is  moving  upward/downward  in  the 
middle  of  the  domain  (i.e.  &ty  =  0,  where  y  is  the  hor¬ 
izontal  coordinate),  and  oppositely  at  the  left  and  right 
ends  of  the  domain.  This  is  associated  with  the  verti¬ 
cal  straining  of  the  nodal  surface  in  the  middle  and  at 
the  left  and  right  ends  of  the  domain.  In  panel  (b)  the 
isosurface  ‘erupts’  with  elements  moving  up  and  down 
along  a  midline  pointing  into  the  plane.  The  eruption 
reaches  its  maximum  extension  when  the  upper  and 
lower  surfaces  stop  their  motion,  and  reverses  direction 
around  the  time  of  panel  (c).  At  y  =  0  on  the  ‘nodal 
surface’  this  is  a  time  of  maximum  vertical  dilational 
strain  but  zero  strain  rate  (where  dwfdz  is  the  vertical 
strain  rate).  The  structures  formed  by  these  eruptions 
represent  localized  intrusions  of  heavy  fluid  into  light 
fluid  and  vice  versa.  We  shall  refer  to  them  as  ‘spouts.’ 
As  the  upper  and  lower  isosurfaces  move  back  toward 
the  ‘nodal  simface,’  the  sense  of  straining  motion  is  re¬ 
versed  and  the  spouts  that  were  formed  are  flattened. 
This  causes  a  spreading  out  of  these  structures,  which 
in  some  cases  results  in  tossing  elements  of  the  spouts 
to  the  right  and  left  of  the  midline.  This  leads  to  the 
kind  of  pattern  seen  in  panel  (e)  which  is  in  part  an 
elongated  horizontal  structure  as  opposed  to  the  elon¬ 
gated  vertical  structures  originally  produced  during  the 
vertically  dilational  phase  of  the  large-scale  straining 
motion.  The  flnal  panel  (f)  shows  the  isosurface  a  short 
time  after  the  upper  and  lower  surfaces  have  again  re¬ 
versed  their  direction  of  vertical  motion.  This  is  a  phase 
of  the  motion  near  to  that  of  the  initial  panel  (a),  but 
now  there  is  a  mixed  patch  of  fluid  at  the  mid  section 
(y=0)  of  the  ‘nodal  surface.’ 

To  summarize,  we  can  say  that  the  ‘spouts’  origi¬ 
nate  from  small-scale  deviations  of  the  nodal  surface 
created  by  turbulent  flow  at  the  nodal  surface.  Once 


perturbations  pull  structures  from  the  nodal  planes  ver¬ 
tically,  these  elements  are  subject  to  advection  due  to 
the  large-scale  straining  motion  of  the  forcing  wave.  At 
times  and  positions  where  the  straining  is  highly  di¬ 
lational  in  the  vertical,  these  deviations  from  the  flat 
plane  elongate  vertically  and  narrow  horizontally,  form¬ 
ing  ‘spouts.’  Then,  during  the  vertically  compressional 
and  horizontally  dilational  phase  of  the  forcing,  the 
spout  is  elongated  horizontally  creating  regions  of  con- 
vectively  unstable  overturned  fluid.  Note  that  if  the 
large-scale  forcing  were  the  only  field  acting  on  the 
spout,  than  the  growth  of  the  spout  would  simply  have 
been  reversed  when  the  sense  of  the  straining  motion 
was  reversed.  Thus  the  presence  of  the  eddy  field  must 
play  an  important  role  in  this  irreversible  process.  The 
distortions  of  the  spout  by  the  eddy  field  are  enhanced 
during  the  horizontally  dilational  phase  of  the  evolu¬ 
tion. 

Internal-wave  packets 

The  observations  of  Alford  and  Pinkel  (2000)  show 
vertically  propagating  structures  at  depths  from  150  to 
350  m  which  they  suggest  may  be  internal  wave  pack¬ 
ets.  These  structures  have  vertical  extent  of  about  50  m 
with  internal  vertical  wavelengths  of  about  12  m  and  are 
associated  with  overtinning  events  with  vertical  scales 
of  about  2  m.  Recent  theoretical  analysis  by  Thorpe 
(1999)  provides  a  criterion  for  determining  whether  the 
small-scale  turbulence  generated  by  the  overturns  in  a 
packet  will  be  left  behind  in  just  small  patches  or  in  con¬ 
tinuous  ‘scars’  much  longer  than  the  size  of  the  packet. 
Stimulated  by  these  developments,  we  have  embarked 
on  a  mnnerical  investigation  of  internal  wave  packets. 

Assuming  a  constant  background  Brunt-Vaisala  fre¬ 
quency  TV  and  ignoring  the  effects  of  the  earth’s  ro¬ 
tation,  the  intrinsic  dimensional  frequency  for  internal 
waves  is 

<7  =  TV^.  (21) 

The  observed  frequency  for  one  of  the  wavepackets  in 
the  Alford  and  Pinkel  (2000)  data  is  4  cph.  This  is 
higher  than  the  ambient  TV  w  3  cph.  Since  a  max  =  TV,  it 
is  assumed  that  the  observed  frequency  for  this  packet 
is  the  sum  of  the  intrinsic  frequency  plus  a  Doppler 
shift.  To  predict  this  shift,  it  is  necessary  to  know  the 
wavelength  of  the  packet,  the  magnitude  of  the  ambient 
current  and  its  direction  relative  to  the  packet  propa¬ 
gation  direction.  Alford  and  Pinkel  (2000)  suggest  that 
the  intrinsic  frequency  for  their  packet  with  observed 
frequency  of  4  cph  is  near  0.14  cph  which  leads  one  to  a 
wavelength  of  180  m.  This  suggests  that  the  horizontal . 
wavelengths  in  both  directions  are  much  larger  than  the 
vertical  wavelength.  For  our  numerical  modeling,  this 
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represents  a  difficulty.  We  are  reluctant  to  introduce 
anisotropic  grids  for  fear  of  the  distortions  that  might 
result,  especially  when  applying  simple  sub-grid  scale 
models.  Thus,  in  this  preliminary  work,  we  decided  to 
consider  only  the  case  in  which  horizontal  and  vertical 
wavelengths  were  equal.  The  corresponding  intrinsic 
frequency  would  then  be  about  2  cph  which  would  still 
be  consistent  with  the  observed  packet,  just  requiring 
less  of  a  Doppler  shift  to  match  the  observed  frequency. 

As  for  the  amplitude  of  the  observed  packets,  this 
can  be  given  in  terms  of  the  peak  magnitude  of  the  ob¬ 
served  strain  rate  dwfdz.  The  maximum  value  of  verti¬ 
cal  strain  rate  in  the  Alford  and  Pinkel  (2000)  observa¬ 
tions  is  approximately  iV,  and  in  the  case  of  the  particu¬ 
lar  packet  discussed  above,  it  seems  that  the  maximum 
is  about  0.38i\r. 

In  what  follows,  we  will  examine  the  evolution  of 
a  particular  wave  packet  with  two-dimensional  simula¬ 
tions.  In  an  attempt  to  reproduce  the  kind  of  behavior 
evident  in  the  observations,  we  used  simulations  in  a 
domain  of  200  m  in  both  width  and  depth.  We  used 
a  packet  with  non-dimensional  wavenumbers  of  12  in 
both  directions,  corresponding  to  vertical  and  horizon¬ 
tal  wavelengths  of  (200  m)/12  «  17  m.  Our  2D  simula¬ 
tions  had  an  effective  resolution  corresponding  to  a  cut¬ 
off  wavelength  of  «  0.8  m.  To  follow  this  phenomenon 
in  DNS  with  all  relevant  scales  well  resolved  would  re¬ 
quire  resolution  from  200  m  down  to  a  few  cm,  which 
is  somewhat  impractical.  Since  the  subgrid  scale  model 
used  in  the  3D  simulations  is  not  appropriate  in  2D, 
we  had  recourse  to  hyperviscosity  (with  the  Laplacian 
taken  to  the  eighth  power).  The  simulations  illustrated 
here  are  from  a  spectral  code  dealiased  with  the  3/2 
rule  {Orszag  1971).  Although  there  are  768  wavevec- 
tors  used  in  each  direction,  after  application  of  the  3/2 
nile  this  leaves  only  512  active  modes  in  each  direction. 

Linear  dispersion  of  packets 

The  linearized  version  of  the  Boussinesq  evolution 
equations  can  be  used  to  obtain  a  model  of  the  inter¬ 
nal  wave  packet.  The  vorticity  and  density  of  a  plane 
internal  wave  can  be  written  dimensionally  as 

=  Aekexpi(k  •  r  -  at),  (22) 

where  A  is  an  arbitrary  amplitude  and  e  is  the  eigen¬ 
vector 

Ck  =  {gkky/Nkh,  -gkkx/Nkh,  0,  po).  (23) 

Taking  a  linear  superposition  of  such  waves  distributed 
continuously  in  wavevector  space  and  centered  on  a  par¬ 
ticular  wavevector,  say  ko,  would  produce  an  internal 


wave  packet.  For  example, 

(w,  p')  =  T^ej  G(k  -  ko)eke*(‘'  '""*)£i3A:,  (24) 

with 

G(p)  =  A  exp  {-a^pl  -  b'^pl  -  c^pl) ,  (25) 

where  a,  6,  and  c  are  length  scales,  represents  a  propa¬ 
gating  ellipsoidal  packet.  A  slight  generalization  based 
on  simple  coordinate  rotations  will  also  permit  an  arbi¬ 
trary  choice  for  the  orientation  of  the  ellipsoidal  enve¬ 
lope  relative  to  the  crests  internal  to  the  packet.  Within 
the  envelope,  the  vorticity  and  density  fields  will  have  a 
phase  velocity  in  the  direction  of  ko  and  group  velocity 

Cg  =  Vk^k,  (26) 

which  is  perpendicular  to  the  phase  velocity. 

By  varying  the  dimensions  a,  b  and  c,  we  can  change 
the  shape  of  the  packet  as  needed.  A  likely  candidate 
for  the  packets  whose  effects  are  observed  in  Alford  and 
PinkeVs  (2000)  data  would  suggest  that  at  least  one  of 
these  lengthscales  is  very  large.  For  the  present  calcu¬ 
lations  we  take  a  to  be  infinite.  Then  we  chose  b  and 
c  and  the  orientation  of  the  system  to  be  such  that  the 
envelope  is  an  ellipse  with  major  axis  aligned  along  the 
direction  of  propagation.  Other  choices  may  also  be  of 
interest,  but  that  will  be  explored  in  future  work.  With 
the  ellipse  as  chosen,  the  phase  velocity  is  directed  along 
the  short  axis  and  the  group  velocity  along  the  long 
axis.  In  a  numerical  simulation,  the  packet  can  only 
be  approximated,  with  the  integral  replaced  by  a  dis¬ 
crete  sum  of  wavevectors.  By  using  (24)  and  (25)  with 
t  =  0,  we  are  able  to  construct  the  initial  condition  for 
a  packet  that  is  both  reasonably  confined  in  space  and 
well  resolved  internally. 

The  first  issue  that  we  need  to  address  is  the  disper¬ 
sive  spreading  of  the  wave  packet.  Simple  arguments 
suggest  that  the  physical  extent  of  the  wave  packet  will 
grow  as  Ac^t  in  the  direction  of  the  group  velocity, 
where  Acp  represents  the  spread  in  group  velocities  cal¬ 
culated  for  the  individual  wavevectors  that  contribute 
significantly  to  the  wave  packet.  We  can  make  some 
crude  dimensional  estimates  for  the  rate  of  dispersion 
by  setting  Cg  ~  N/ko  and  Ac^  ~  (Ar/A;o)Afco,  where 
Afco  measures  the  spread  of  wavenumbers  in  the  packet. 
If  we  call  Axo,  the  initial  length  of  the  wavepacket,  then 
the  change  in  the  size  of  the  packet  can  be  crudely  taken 
as 

Ax  -  Axo  =  Acgt.  (27) 

The  packet  would  then  double  in  size  by  a  time  td  ^ 
Axo/Ac^,  and  the  distance  that  the  packet  can  travel 
before  doubling  is 


x/Axo  fco/Afco- 


(28) 
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Figure  8.  Contours  of  the  magnitude  of  the  pertur¬ 
bation  density  Ip'/Pol  ^om  a  simulation  of  the  linear 
propagation  of  a  wavepacket.  The  domain  size  is  200 
m  on  each  side.  The  vertical  axis  is  depth.  The  only 
contour  level  drawn  is  that  at  0.5  of  the  maximum  field 
value.  The  time  sequence  of  the  panels  is  (a)  t— 0,  (b) 
t=3  hr,  (c)  t=6  hr,  and  (d)  11  hr. 


For  the  packet  used  in  the  simulations  this  predicts  a 
doubling  after  propagation  of  about  200  m. 

In  Figure  8,  we  show  the  evolution  of  the  density 
perturbation  field  during  the  propagation  of  our  packet 
following  purely  linear  dynamics.  In  each  panel,  only 
the  contour  level  corresponding  to  0.51pVpoI  is  drawn. 
Positive  and  negative  values  have  not  been  indicated, 
but  clearly  the  sign  of  p'  will  alternate  from  one  wave 
crest  to  the  next.  We  see  the  packet  propagates  along 
the  diagonal.  This  is  in  agreement  with  the  fact  that  the 
wavevector  is  k  =  (12, 12)  and  that  the  group  velocity 
is  perpendicular  to  this.  It  is  less  obvious  from  the 
few  panels  that  we  can  include  here  that  the  phase  of 
the  waves  within  the  packet  advances  in  the  direction 
of  k.  The  average  speed  of  the  packet  in  propagating 
from  one  corner  of  the  domain  to  the  opposite  comer 
is  correctly  given  by  \cg\.  Furthermore,  we  see  that  the 
width  and  length  of  the  packet  grow  to  a  little  more 
than  double  their  original  values  in  the  time  it  takes  to 
cross  from  one  corner  of  the  domain  to  the  other,  and 
this  is  correctly  predicted  by  the  formula  (28).  During 
the  period  of  evolution  illustrated,  the  peak  amplitude 
of  the  packet  decays  to  25%  of  its  initial  value. 

Although  the  amplitude  of  the  packet  can  be  changed 


Figure  9.  Contours  of  p/po  from  a  simulation  of  the 
linear  propagation  of  a  wavepacket.  The  domain  size  is 
200  m  on  each  side.  The  vertical  axis  is  depth.  The  time 
sequence  of  the  panels  is  the  same  as  in  Figure  8.  The 
contour  increment  is  such  that  the  vertical  separation 
between  unperturbed  isopycnals  is  8  m. 

arbitrarily  in  this  purely  linear  simulation,  we  may  sim¬ 
ply  assign  an  amplitude  to  see  the  effect  of  such  a  packet 
on  the  full  density  field.  This  is  done  in  Figure  9.  The 
amplitude  used  represents  fluctuations  in  dwfdz  about 
five  times  the  maximum  actually  observed  in  the  Alford 
and  Pinkel  (2000)  data.  Nevertheless,  we  have  used  this 
packet  with  exaggerated  amplitude  to  more  clearly  il¬ 
lustrate  the  nature  of  the  linear  propagation.  In  such 
a  strong  packet,  there  are  regions  of  strong  overturn¬ 
ing,  which,  if  the  packet  is  not  propagating  too  rapidly, 
would  develop  convective  instability  under  the  full  non¬ 
linear  dynamics. 

Nonlinear  propagation  of  packets 

Having  determined  that  our  packet  propagates  cor¬ 
rectly  under  linear  d5mamics,  we  then  investigated  its 
evolution  with  the  complete  Boussinesq  equations.  The 
amplitude  of  the  observed  packet  discussed  in  the  intro¬ 
duction  is  such  that  the  maximum  value  of  the  strain 
rate  dwfdz  is  about  0.38iV.  With  the  packet  amplitude 
set  to  match  this  value  as  its  maximum  dwfdz^  we  per¬ 
formed  the  simulation  illustrated  by  contour  plots  of 
p'/p  in  Figure  10.  This  figure  should  be  compared  to 


Figiire  10.  Contours  of  the  magnitude  of  the  pertur¬ 
bation  density  |pVpo|  from  a  simulation  of  the  nonlinear 
propagation  of  a  wavepacket  with  max  dw/dz  «  0.38A^. 
The  domain  size  is  200  m  on  each  side.  The  vertical  axis 
is  depth.  The  only  contour  level  drawn  is  that  at  0.5 
of  the  maximum  field  value.  The  time  sequence  of  the 
panels  is  the  same  as  in  Figure  8. 


the  corresponding  figure  for  linear  evolution,  Figure  8. 
The  times  represented  are  the  same  in  each  figure.  By 
the  time  of  panel  (b)  a  clear  asymmetry  in  the  form 
of  the  packet  has  developed  in  the  nonlinear  case  and 
there  is  some  clear  distortion  of  the  packet  in  the  fi¬ 
nal  panel.  Nevertheless,  the  overall  evolution  of  this 
nonlinear  packet  is  not  very  different  from  the  linear 
case.  This  packet  is  so  weak  that  the  initial  condition 
is  not  overturning  anywhere  and  the  Richardson  num¬ 
ber  is  above  1  everywhere.  Thus,  the  classical  criteria 
for  convective  instability  and  shear  instability  are  not 
satisfied  in  this  packet.  This  continues  to  be  the  case 
throughout  the  simulation  in  spite  of  small-scale  gen¬ 
eration  by  nonlinear  wave-wave  interactions.  An  idea 
of  how  weak  this  packet  is  can  be  obtained  graphically 
from  the  plots  of  the  density  contours  as  illustrated  in 
Figure  11. 

The  next  case  that  we  will  treat  is  one  for  which  the 
amplitude  of  the  packet  is  just  above  the  threshold  for 
overturning.  The  amplitude  of  this  packet  in  terms  of  its 
maximum  strain  rate  is  dw/dz  =  O.TGAT.  In  Figure  12, 


Figure  11.  Contours  of  p/po  from  the  same  simulation 
as  represented  in  Figure  10.  The  domain  size  is  200 
m  on  each  side.  The  vertical  axis  is  depth.  The  two 
times  illustrated  correspond  to  the  first  and  last  times 
of  Figure  10.  The  contour  increment  is  such  that  the 
vertical  separation  between  unperturbed  isopycnals  is  8 
m. 


we  display  the  contour  plots  for  the  perturbation  den¬ 
sity  at  the  same  times  as  in  the  previous  figures.  We  see 
that  there  is  some  early  production  of  small  scales  that 
are  evident  in  the  wake  of  the  packet.  By  t  =  6  hr  the 
packet  itself  has  become  badly  distorted,  and  by  t  =  11 
hr,  it  has  degenerated  into  small-scale  structures,  al¬ 
though  these  still  retain  to  some  extent  an  organization 
and  alignment  related  to  the  original  structure  of  the 
packet.  To  better  illustrate  the  decay  of  this  packet, 
we  display  contour  plots  of  the  full  density  field  from 
t  =  2.5  hr  to  t  =  4.8  hr  in  Figure  13.  Each  frame  is  an 
enlarged  image  centered  on  the  wave  packet,  showing 
only  a  portion  of  the  domain  (a  square  of  size  200/3 
m  on  a  side).  In  panel  (a)  we  see  an  early  stage  in 
which  the  wave  is  overturning  at  points,  but  there  has 
not  yet  been  any  strong  production  of  energy  in  scales 
smaller  than  2  m  (note  that  the  spacing  between  the 
imperturbed  isopycnals  is  2  m).  There  are  four  rela¬ 
tively  strong  crests  evident  in  panel  (a).  These  crests 
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Figure  12.  Contours  of  the  magnitude  of  the  pertur¬ 
bation  density  \p'/po\  from  a  simulation  of  the  nonlinear 
propagation  of  a  wavepacket  with  max  dwfdz  ^  0.76N. 
The  domain  size  is  200  m  on  each  side.  The  vertical  axis 
is  depth.  The  only  contour  level  drawn  is  that  at  0.5 
of  the  maximum  field  value.  The  time  sequence  of  the 
panels  is  the  same  as  in  Figure  8. 

are  advancing  from  bottom-left  to  top-right  in  these  fig¬ 
ures.  The  weakest  crest  (bottom-left)  is  just  entering 
the  packet  in  panel  (a).  In  the  linear  evolution  as  each 
crest  passes  through  the  packet  from  bottom-left  to  top- 
right,  its  amplitude  first  increases  and  then  decreases. 
As  envisioned  by  Thorpe  (1999),  the  crests  amplify  as 
they  move  toward  the  center  of  the  packet  and  break 
leaving  small-scale  perturbations  behind  that  link  up 
with  the  ‘debris’  produced  by  the  passage  of  previous 
crests.  The  period  of  the  sequence  of  panels  shown  here 
is  long  enough  for  the  weak  crest  on  the  lower-left  side  of 
the  packet  in  panel  (a)  to  move  completely  through  the 
packet,  finally  becoming  the  weak  crest  on  the  upper- 
right  side.  In  the  case  we  have  simulated  here,  the  crests 
do  produce  overlapping  zones  of  small-scale  perturba¬ 
tions  that  form  a  somewhat  continuous  scar,  a  possiblil- 
ity  suggested  by  Thorpe  (1999).  One  should  note,  how¬ 
ever,  that  during  the  period  when  a  particular  crest  is 
actually  breaking,  the  overturning  and  small-scale  pro¬ 
duction  is  not  uniform  along  the  length  of  the  crest, 
as  assumed  in  Thorpe’s  idealized  model,  but  rather  ap¬ 
pears  in  spots  along  the  crest  (see  panels  (c)  and  (d)). 
Also  the  breaking  and  subsequent  scar  formation  does 


not  continue  indefinitely.  The  strength  of  the  packet  is 
both  dispersed  and  dissipated,  so  that  by  t  =  210N~^ 
the  process  of  scar  formation  has  ceased. 

We  have  also  performed  3D  simulations  of  the  prop¬ 
agation  of  these  wave  packets.  The  general  evolution 
exhibited  in  the  2D  simulations  is  also  found  in  3D, 
although  in  3D  we  did  not  have  sufficient  resolution  ad¬ 
equately  capture  the  2  m  overturns.  Further  details  can 
be  foimd  in  Camevale  and  Orlandi  (2000). 

Theory  of  packet  evolution 

We  have  addressed  here  questions  about  the  longevity 
of  wave  groups,  A  theory  for  the  evolution  of  the  packet 
envelope  has  been  developed  by  Shrira  (1981)  via  mul¬ 
tiscale  analysis  in  both  space  and  time.  His  result  for 
the  evolution  of  the  amplitude  A  of  the  packet  in  the 
two-dimensional  case  studied  above  is 

iAr  +  {^kykyAyy  +  2akykxAyz  +  (^k^k^Azz) 

^  '^{.^kykykyAyyy  -f-  ^CkykykxAyyz  k  z  k  z  Ay  z  z 

‘^o'k^kzkzAzzz}  =  —i'yA{AA*  —  A*  As),  (29) 

where  ky  and  kz  are  components  of  the  central  wavector 
of  the  packet,  a  is  the  intrinsic  frequency  corresponding 
to  the  central  wavevector,  s  is  the  coordinate  in  the 
direction  of  propagation  of  the  packet,  d/dr  =  + 

Cgd/ds,  Cg  is  the  magnitude  of  the  group  velocity,  and 

7  =  P/{akykz).  (30) 

The  typical  equation  that  arises  for  the  evolution  of  a 
wave-packet  envelope  is  the  cubic  Schroedinger  equa¬ 
tion  which  is  significantly  different  from  (29),  It  turns 
out  that  the  term  corresponding  to  the  nonlinearity  in 
the  cubic  Schroedinger  vanishes  identically  here  due  to 
the  fact  that  a  plane  wave  cannot  interact  with  itself. 
Thus  Shrira  had  to  go  to  third  order  in  the  multiple 
scale  analysis  to  obtain  the  first  contributions  of  the 
nonlinearity  to  the  evolution.  This  still  involves  a  cubic 
term  for  the  nonlinearity,  but  now  not  the  simple  A|A|^ 
of  the  cubic  Schroedinger  equation,  and  the  presence  of 
the  third  order  spatial  derivatives  from  the  linear  terms 
further  complicates  matters.  Notice  that  the  coefficients 
depend  in  magnitude  and  sign  on  the  orientation  of  the 
central  wavevector  of  the  packet.  Thus  we  can  antici¬ 
pate  interesting  results  as  this  wavevector  is  varied.  In 
addition,  there  are  nonlocal  nonlinear  terms  that  arise  if 
the  flow  is  three  dimensional  that  further  greatly  com¬ 
plicate  the  evolution.  In  future  work,  we  plan  to  in¬ 
vestigate  the  evolution  of  the  packet  analytically  based 
on  Shrira’s  equations,  and  make  a  comparison  with  our 
numerical  results. 


INTERNAL  WAVE  BREAKING 


171 


Figure  13.  Contours  of  p/po  from  a  simulation  of  the  nonlinear  propagation  of  a  wavepacket  with  initially  max 
dwfdz  «  0.76iV.  Only  a  portion  of  the  computational  frame  is  shown,  and  this  corresponds  to  a  square  200/3  m 


on  each  side.  The  contour  increment  is  such  that  the  vertical  separation  between  imperturbed  isopycnals  is  2  m. 


The  times  corresponding  to  the  panels  are  (a)  2.5,  (b)  2.9  (c)  3.5,  (d)  3.9,  (e)  4.4  and  (f)  4.8  hr. 
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Conclusions 

We  have  attempted  with  some  very  idealized  numer¬ 
ical  modelling  to  shed  some  light  on  the  structures  that 
may  be  responsible  for  the  variability  observed  in  the 
oceanic  thermocline.  This  attempt  has  been  successful 
to  a  certain  extent.  We  were  able  to  show  a  transition 
from  buoyancy  to  inertial  range  that  follows  the  fea¬ 
tures  of  the  observed  spectra,  but  only  during  a  break¬ 
ing  event.  To  what  extent  this  simulated  transition  is 
related  to  the  actual  transition  in  the  oceanic  obser¬ 
vations  is  still  not  clear.  We  were  able  to  give  three- 
dimensional  structure  to  a  classical  shear  type  instabil¬ 
ity  much  as  foimd  in  the  observations  and  at  about  the 
correct  vertical  scale.  In  addition,  we  described  an  un¬ 
expected  kind  of  overturning  and  mixing  that  occurs  in 
regions  of  high  strain  rate.  The  correlation  for  overturns 
and  high  strain  rate  was  suggested  by  Alford  and  Pinkel 
(2000),  and  yet  it  is  not  clear  at  this  point  whether  the 
structures  we  were  able  to  simulate  are  directly  related 
to  any  of  the  overturns  reported  in  that  paper.  Fi¬ 
nally,  two-dimensional  simulations  demonstrated  that 
a  packet  with  approximately  the  correct  vertical  struc¬ 
ture  can  propagate  through  a  substantial  portion  of  a 
thermocline  as  observed  by  Alford  and  Pinkel  (2000) 
without  dispersing  radically  and  with  the  production  of 
overturns  on  the  scale  of  about  2  m  which  is  entirely 
consistent  with  Alford  and  PinkeFs  observations.  Un¬ 
fortunately,  these  simulations  could  not  also  capture  the 
large  horizontal  scales  nor  the  rapid  advection  of  pack¬ 
ets  that  is  suggested  by  Alford  and  Pinkel  (2000).  Thus, 
we  feel  that  there  has  been  some  progress  in  demon¬ 
strating  the  possibility  of  analyzing  the  small-scale  fluc¬ 
tuations  in  the  thermocline  with  numerical  simulations 
using  subgrid  scale  parameterizations,  and  look  forward 
to  advancing  this  work  so  that  closer  comparisons  with 
observations  may  be  possible. 
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Abstract.  In  the  stratified  ocean,  vertical  motions  arise  from  both  adiabatic 
and  diabatic  mechanisms.  Diapycnal  advection  is  the  vertical  component 
of  flow  across  an  isopycnal  smrface  which  occurs  when  miving  produces 
a  divergent  flux  of  buoyancy.  Buoyancy  forcing  of  the  lateral  flow  by 
vortex  stretching  occurs  when  diapycnal  advection  rates  vary  with  depth. 
Microstructure  observations  of  enhanced  turbulence  above  rough  topography 
are  presented.  These  data  allow  us  to  distinguish  the  influence  of  diabatic 
forcing  on  the  circulation.  Estimates  of  diapycnal  advection  are  used  to 
quantify  the  diabatic  flow,  and  the  role  of  diapycnal  advection  as  a  mechanism 
of  vortex  stretching  on  the  circulation  is  assessed.  The  divergence  of  diapycnal 
mass  flux  is  foimd  to  be  a  significant  forcing  mechanism  for  the  circulation 
occurring  above  firactiure  zone  topography.  Moreover,  both  the  direction 
and  magnitude  of  diapycnal  advection  are  dependent  on  the  efficiency  of 
turbulence  at  generating  a  buoyancy  flux.  Depth  variations  of  diapycnal 
advection  suggest  the  mixing  efficiency  is  reduced  in  weakly  stratified  abyssal 
canyons. 


Introduction 

The  sun  heats  the  Earth’s  atmosphere  and  ocean, 
driving  the  winds,  evaporation,  and  rain.  These  act  over 
global  scales  to  input  mechanical  energy  and  variance 
to  the  scalar  fields  of  the  ocean.  Turbulence  acts  at  the 
smallest  scales  of  fiuid  motion,  where  variance  is  dissi¬ 
pated  by  molecidar  processes.  Stirring  is  the  mechanism 
that  acts  at  the  intermediate  range  of  scales,  advecting 
variance  both  in  physical  and  wavenumber  space. 

The  input  of  variance  at  the  largest  scales  is  related 
to  the  mechanical  and  buoyancy  forcing  that  drives  the 
so-called  “large-scale  circulation”.  However,  circula¬ 
tions  can  be  forced  by  mechanisms  occurring  anywhere 
along  the  variance  cascade.  For  example,  Spall  (1994) 
describes  large-scale  abyssal  recirculations  driven  by  the 
eddy  fluxes  of  topographic  waves.  Examples  of  eddy 
driven  mean  flows  are  described  by  Lozier  (1997)  and 
Alves  and  Colin  de  Verdiere  (1999).  Buoyancy  forcing 
may  also  originate  at  any  scale.  In  particular,  turbulent 
mixing  at  the  smallest  scales  of  motion  can  give  rise  to 


divergent  fluxes  of  heat  and  salt.  If  the  buoyancy  forces 
that  result  from  turbulence  are  significant  relative  to 
other  forcing,  a  large-scale  circulation  may  develop  by 
the  direct  influence  of  small-scale  mixing.  Spall  (2000) 
demonstrates  that  mixing  near  large-scale  topography  is 
particularly  effective  at  driving  strong  buoyancy-forced 
circulations. 

Stommel  (1957)  first  proposed  that  the  abyssal  circu¬ 
lation  of  the  ocean  may  be  driven  by  buoyancy  forcing, 
and  a  model  proposed  by  Stommel  and  Arons  (I960) 
described  a  simple  set  of  d3mamics  governing  the  buoy¬ 
ancy  driven  flow.  The  Stommel- Arons  model  is  still 
central  to  our  view  of  the  abyssal  circulation  problem, 
and  the  important  role  of  buoyancy  forcing  in  the  abyss 
is  not  questioned.  However,  the  nature  of  deep  buoy¬ 
ancy  forcing  has  only  recently  come  to  light,  with  ob¬ 
servations  providing  new  insight  into  the  mechanisms 
driving  abyssal  circulations.  Observations  of  turbulence 
across  the  Brazil  Basin  show  that  abyssal  mixing  rates 
are  spatially  nonuniform,  with  enhanced  mixing  occur¬ 
ring  above  regions  of  rough  topography  (Polzin  et  al, 
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1997).  Ledwell  et  al.  (2000)  and  St.  Laurent  et  al. 
(2001)  suggest  that  energy  from  internal  tides  sustains 
the  elevated  turbulence  in  these  regions. 

In  the  report  that  follows,  the  implications  for  buoy¬ 
ancy  forcing  by  small-scale  mixing  are  discussed.  Obser¬ 
vations  from  the  abyssal  Brazil  Basin  described  by  St. 
Laurent  et  al.  (2001)  are  summarized  here.  Estimates 
of  cross-isopycnal  motion,  termed  diapycnaJ  advection, 
are  used  to  quantify  the  diabatic  flow.  The  diapycnal 
advection  serves  as  a  primary  measure  of  buoyancy  forc¬ 
ing. 

Diapycnal  advection 

We  wish  to  consider  the  diabatic  flow  in  the  ocean 
interior  driven  by  a  divergent  buoyancy  flux.  The  equa¬ 
tion  for  buoyancy  (b  =  -{g/po)p')  is 

^  +  u- Vb-)-wN2  = -V  F- (1) 

where  u  =  (u,v)  is  the  lateral  flow,  V  =  {d/dx,d/dy) 
is  the  lateral  differential  operator,  and  (F,  Jt)  are  the 
lateral  and  vertical  components  of  buoyancy  flux.  To 
distinguish  diabatic  from  adiabatic  flow,  it  is  useful  to 
consider  the  buoyancy  budget  along  a  neutral  surface. 
Along  a  neutral  surface  whose  depth  is  described  by 
Zi{x,y,t),  buoyancy  is  subject  to  the  relations 

Vbi  =  Vb+'7ziN^  =  0, 

(2) 

dt  dt  ^  at  ~ 

where  (•)i  denotes  an  isopycnal  function  of  (r,  y,  t),  and 
=  dbfdz.  Additionally,  since  buoyancy  flux  across 
an  isopycnal  occurs  only  by  small-scale  turbulence,  all 
fluxes  may  be  expressed  in  terms  of  a  turbulent  diffu- 
sivity  kp  such  that 


Fi  =  -kpVbi  =  -kpVb  -  kpVziN^  =  0, 
Jb  =  -kpN^. 


(3) 


Thus,  the  equation  for  buoyancy  on  a  neutral  surface 
can  be  expressed  as 


^  _  U .  Vzi)  -  V  •  (kpVziN^) . 

(4) 

■  We  define  the  vertical  advection  quantity  w,  =  w- 
dzi/dt  -  u  •  Vzi  as  the  “diapycnal  advection”.  As  dis¬ 
cussed  by  McDougall  (1987,  1995)  and  Pedlosky  (1996), 
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Figure  1.  Diagram  showing  the  parameters  of  a 
steady-state  density  layer. 


w,  is  the  vertical  component  of  flow  through  (not  nor¬ 
mal  to)  isopycnals,  and  is  the  vertical  motion  that  is 
caused  by  a  divergent  buoyancy  flux.  The  divergent 
buoyancy  flux  is  comprised  of  two  terms;  a  standard 
vertical  divergence  term  -dJbldz,  and  a  lateral  diver¬ 
gence  of  vertical  buoyancy  flux  V  •  (Vz*  Jb) .  It  is  use¬ 
ful  to  consider  the  ratio  of  these  terms,  with  scaling 
dJbjdz  ~  S~^Jb,  and  V  •  (Vz,  Jb)  ~  £~^6Jb  for  charac¬ 
teristic  lateral  and  vertical  scales  {£,  S).  The  ratio  of  the 
vertical  to  lateral  divergence  terms  is  (i/S)^.  In  general, 
we  will  assume  that  the  aspect  ratio  S/t  of  isopycnals 
is  small,  such  that  (e/S)^  »  1,  and  (4)  can  be  approxi¬ 
mated  as 


(5) 

Dynamical  Considerations 

Consider  a  large-scale  geostrophic  and  inviscid  flow. 
The  potential-vorticity  equation  for  flow  in  a  steady- 
state  density  layer  (Fig.  1)  is 

/3u=:^u-Vh+^(w,(z„)-w,(zi)),  (6) 

where  h{x,  y)  is  the  thickness  of  the  density  layer,  and 
{z-ay  zi )  denote  the  upper  and  lower  bounding  isopycnal 
surfaces.  The  above  expression  is  the  density-layered 
version  of  the  Stommel  and  Schott  (1977)  beta-spiral 
equation,  and  is  discussed  by  McDougall  (1988)  and 
Hautala  and  Riser  (1993).  In  (6),  the  advection  of  plan¬ 
etary  vorticity  is  balanced  by  vortex  stretching.  Vortex 
stretching  may  occur  adiabatically  by  flow  along  sloping 
isopycnals,  or  diabatically  by  the  divergence  of  diapyc¬ 
nal  advection.  Diabatic  vortex  stretching  is  related  to 


MIXING  AND  DIAPYCNAL  ADVECTION  IN  THE  OCEAN 


177 


the  divergence  of  buoyancy  flux  by  (5).  During  mixing 
events,  vertical  exchanges  of  buoyancy  are  accompanied 
by  dissipation  of  turbulent  kinetic  energy  (TKE),  and 
the  energy  budget  for  an  eiisemble  of  turbulent  events 
is 


(1  —  Rf  )Ji  -h  J?/  e  =  0,  (7) 

where  e  is  the  dissipation  rate,  and  Rf  is  the  efficiency 
of  the  mixing.  The  TKE  budget  expressed  in  (7)  as¬ 
sumes  steady-state  and  homogeneous  statistics  of  the 
Reynolds  stress  tensor.  This  equation  is  discussed  by 
Osborn  (1980),  and  an  expression  for  the  eddy  diffusiv- 
ity  is  readily  derived  by  substituting  =  —kpN^  into 

(7), 

=  (r^)  (]^)  • 

The  fraction  T  =  Rf  {1  —  Rf)^^  (equivalently  T  = 
Jb/e),  the  ratio  of  buoyancy  flux  to  energy  dissipated, 
is  typically  taken  to  be  <20%  for  stratified  turbulence. 
With  use  of  (5)  and  (7),  the  diapycnal  advection  can  be 
expressed  as 


w,  ^  (9) 

and  the  potential  vorticity  equation  (6)  can  be  ex¬ 
pressed  as 


j3v  =  *  Vh  + 

ft 


dz 


(10) 


where  we  have  taken  {w^{zu)  -  w^{zi))  /h  =  dw^jdz. 
As  a  clear  way  of  demonstrating  the  importance  of 
the  diabatic  forcing  term  in  (10),  it  is  useful  to  consider 
the  following  scaling  relations.  Using  standard  nota¬ 
tion,  we  scale  (x,  z)  ~  (L,  H),  u  ~  U,  UjV'  and  de¬ 
fine  the  deformation  radius  as  jR^  =  NHff.  The  scaling 
for  the  term  V/i  is  taken  from  the  geostrophic  relation 
fU  g^Vh^  where  the  reduced  gravity  is  g'  ~  N^H. 
Additionally,  we  scale  the  energy  dissipation  term  as 
€  fE,  where  E  is  the  scale  of  the  kinetic  energy  as¬ 
sociated  with  the  forcing  for  the  vertical  mixing.  This 
scaling  for  e  is  proposed  on  solely  dimensional  grounds, 
noting  that  a  time  scale  is  relevant  for  many  dy¬ 
namical  regimes.  Application  of  these  scaling  relations 
in  (10)  yields  the  following  nondimensional  equation. 


Vh 

h  ■*'u2az 


(11) 


where  the  lower-case  variables  are  now  dimensionless. 
The  ratio  of  the  diabatic  to  adiabatic  stretching  is 

diabatic  stretching  E 

adiabatic  stretching  U^'  ^  ^ 

The  relative  importance  of  the  diabatic-forcing  term 
will  be  set  by  the  energy  level  of  the  process  controlling 
the  vertical  mixing.  In  the  case  of  dissipating  internal- 
tide  energy,  the  energy  flux  scales  as  In  the  abyss 
where  geostrophic  flow  is  weak,  t3q)ical  geostrophic  ve¬ 
locities  are  1/  ~  5  mm  s”^,  while  tidal  velocities  are 
generally  C/tide  ~  20  mm  s'”^  Thus  a  ratio  of  0(1)  for 
the  diabatic  to  adiabatic  vorticity  forcing  is  easy  to  jus¬ 
tify,  demonstrating  the  strong  dynamical  link  between 
mixing  and  circulation  in  the  abyss. 

Observations  of  turbulence  in  the 
Abyssal  Brazil  Basin 

In  the  period  between  January  1996  and  April  1997, 
two  microstructure  surveys  were  made  in  the  Brazil 
Basin.  The  1996  survey  consisted  of  a  basin-scale  sur¬ 
vey  and  the  initiation  of  a  tracer  dispersion  experiment 
at  a  site  near  the  Mid  Atlantic  Ridge  (MAR).  During 
the  1997  survey,  a  detailed  survey  was  conducted  over  a 
5  X  10^  km^  region  of  rough  topography  near  the  tracer 
release  site.  The  primary  focus  of  the  1996  survey  was 
the  spatial  variability  of  mixing  levels,  and  a  report  on 
the  observed  variability  is  given  by  Polzin  et  al  (1997). 
Results  from  the  tracer  release  experiment  are  described 
by  Ledwell  et  al  (2000). 

The  presentation  here  will  focus  on  the  dissipation 
data  collected  over  the  fracture  zone  (FZ)  topography 
near  the  MAR  (Fig.  2).  All  profiles  consist  of  conven¬ 
tional  hydrographic  variables  (e.g.,  6, 5,  U,  V)  in  addi¬ 
tion  to  microstructure,  and  generally  extend  from  the 
surface  mixed  layer  to  within  20  m  of  the  bottom.  The 
bathymetric  data  shown  in  Fig.  2  were  derived  from 
satellite  measurements  of  the  marine  gravity  field  Smith 
and  Sandwell  (1997).  The  map  clearly  shows  the  net¬ 
work  of  FZs  leading  east  to  the  MAR.  This  system  of 
FZs  is  characterized  by  a  series  of  canyons  boimded 
latitudinally  by  crests  that  rise  up  to  1  km  above  the 
canyon  floors. 

Turbulent  kinetic  energy  dissipation  rates  were  de¬ 
rived  from  observations  of  velocity  microstructure.  Only 
two  components  of  the  strain  tensor  are  measured,  and 
isotropy  is  assumed  to  express  the  dissipation  rate  as 
e  =  (15/4)-  (<  >  -h  <  uf  >).  A  summary  of 

the  observed  deep-dissipation  data  was  made  by  col¬ 
lapsing  the  latitudinal  spread  of  the  profiles  (Fig.  3). 
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Figure  2.  Map  of  the  1997  survey  site  with  the  2  arc-minute  resolution  bathymetric  estimates  of  /it  Smith  and 
Sandwell  (1997).  A  total  of  129  profiles  of  dissipation  rate  observations  (crosses)  were  collected  in  this  region. 


1500 

2000 

2500 

3000 

■i-3500 

Q. 

0) 

■^4000 

4500 

5000 

5500 


-24  -22  -20  -18  -16  -14  -12 

longitude 


Figure  3.  A  zonal  section  of  TKE  dissipation  rate  c.  The  individual  profiles  are  shown  as  columns,  and  dissipation 
rate  observations  were  vertically  averaged  into  100  m-bins.  The  shaded  bathymetry  is  representative  of  canyon 
floors,  while  the  shallower  trace  represents  the  level  of  ridge  crests. 
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To  simplify  the  presentation,  only  the  depth  range 
deeper  than  1500  m  is  shown.  The  dissipation  data 
were  depth  averaged  into  100-m  intervals,  and  indi¬ 
vidual  profiles  are  shown  as  columns  of  binned  dissi¬ 
pation.  Since  the  latitudinal  extent  of  the  survey  has 
been  collapsed,  no  particular  section  of  bathymetry  is 
entirely  appropriate.  Instead,  two  classes  of  representa¬ 
tive  bathymetry  are  shown:  the  bathymetry  of  FZ  crests 
and  the  bathymetry  of  canyon  floors,  as  sampled  from 
the  latitude  range  21®S  to  23®S.  Within  about  500  m 
of  the  bottom,  there  is  an  enhancement  of  dissipation 
by  more  than  two  orders  of  magnitude  over  values  ob¬ 
served  at  mid  depth.  Enhanced  dissipation  generally 
occurs  between  the  level  of  FZ  crests  and  canyon  floors, 
suggesting  the  largest  mixing  rates  occur  in  canyons. 

An  inverse  estimate  of  circulation 

Surveyed  hydrography  was  used  to  divide  the  flow 
regime  into  a  series  of  density  layers.  In  these  layers, 
geostrophic  advection  is  related  to  isobaric  gradients  of 
geopotential  anomaly  {ip)  by 

U  =  z  X  -J-  -f  Uo,  (13) 

where  uq  is  a  reference  level  velocity.  Thus,  the  ad¬ 
vection  of  potential  vorticity  through  a  density  layer  of 
thickness  h{x,y),  as  previously  stated  in  (6),  may  be 
expressed  in  the  form 

-t- (^  -  I)  Vo  -F  i{w»izu)-w4zi)) 

—  -Ur^  -  Vr  —  I j  , 

where  axe  the  components  of  relative  geostrophic 
velocity  given  by  {u  -  uo,v  -  uq)  in  (13).  Thus,  the 
problem  of  determining  the  lateral  flow  u  is  reduced 
to  determining  the  reference  level  velocity,  i.e.,  the  ver¬ 
tical  integration  constants  (uqj^o)  of  a  thermal- wind 
balance. 

In  layers  where  flow  encounters  topography,  the  use 
of  the  geostrophic  balance  is  questionable.  A  modified 
momentum  balance,  such  as  that  of  the  bottom 
layer,  may  be  called  for  in  regions  where  a  density  layer 
comes  within  0(100)  m  of  the  bottom.  Additionally, 
the  difliculty  of  defining  the  geopotential  anomaly  along 
pressure  surfaces  interrupted  by  bathymetry  severely 
limits  the  practical  use  of  (13)  and  (14)  in  deep  lay¬ 
ers.  For  these  reasons,  it  is  necessary  to  employ  ad¬ 
ditional  dynamical  constraints  on  the  flow.  This  is 


Figure  4.  Dissipation  profiles  averaged  according  to 
bathymetric  classification.  Bathymetric  classes  were 
assigned  as  crests,  canyons,  or  slopes.  All  profiles 
are  shown  relative  to  a  reference  dissipation  of  e  = 
1  xl0”^°  W  kg“^.  The  95%  confidence  interval  is  shown 
for  each  100-m  average. 

achieved  through  the  use  of  integrated  advective  bud¬ 
gets  for  mass. 


JfdydzuC'^  +  IJdxdzvll-^^ 

(15) 

+  Ifdxdyw^.\l^  =  0 
and  for  potential  temperature 


f  fdydz 

f  Jdxdz 

+ 

//dxdjz  w.e'l®” 

fJdydzKe^l^-^^ 

+ 

J  J dxdz 

4- 

f  fdxdy  kpQzll'^  • 

(16) 

The  integrated  expressions  (15)  and  (16)  are  bounded 
in  control  volumes  of  meridional  and  zonal  extent  (7,  A), 
and  vertically  bounded  in  a  density  layer  by  the  surfaces 
Zuix,y)  and  zi  {x,y). 

Relations  for  the  diapycnal  diabatic  terms  are  ex¬ 
pressed  using  the  dissipation  rate  with  the  use  of  (8)  and 
(9).  For  use  in  the  inverse  model,  the  mixing  efficiency 
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Figure  5.  Layer- averaged  flow  for  the  density  layer  28.16  <  cTn  <  28.20.  The  gridded  surfaces  show  the  an  =  28.16 
(upper)  and  an  =  28.20  (lower)  isopycnals.  Bottom  bathymetry  was  subsampled  onto  a  0.25°  x  0.25°  Mercator 
grid.  The  largest  vectors  are  (5-6)  mm  s“^. 


parameter  F  is  treated  as  constant,  Fq  =  0.20  ±  0.04. 
This  allows  us  to  specify  the  diapycnal  diffusivity  for 
turbulence  as 


and  the  diapycnal  advection  as 


Fo  ^ 

N^  dz' 


(17) 

(18) 


We  emphasize  that  (17)  and  (18)  are  approximations, 
where  a  constant  mixing  efficiency  has  been  assumed. 
While  oceanic  observations  of  turbulence  from  well 
stratified  regions  suggest  Fq  =  0.2  {Mourn,  1996;  St 
Laurent  and  Schmitt,  1999),  observations  from  fjords 
suggest  F  =  0.05  {Stigebrandt  and  Aure,  1989).  Ad¬ 
ditionally,  laboratory  measurements  of  turbulence  sug¬ 
gest  0.05  <  F  <  0.25  characterizes  the  variation  of  mix¬ 
ing  efficiency  over  a  wide  range  Reynolds,  Richardson, 
and  Proude  numbers  {Ivey  and  Imberger,  1991;  Huq  and 
Britter,  1995).  In  our  inverse  model  estimates,  we  ac¬ 
cept  that  using  Fq  =  0.2  zb  0.04  may  lead  to  a  specifi¬ 
cation  of  kp  that  is  biased  high.  The  use  of  a  constant 
mixing  efficiency  in  (18)  is  more  problematic,  as  varia¬ 
tions  in  F  with  depth  will  not  only  influence  the  mag¬ 
nitude  of  the  estimate,  but  also  the  sign  {w^  >  0 


for  upwelling,  <  0  for  downwelling) .  For  this  rea¬ 
son,  is  regarded  as  an  unknown  in  the  inverse  model, 
and  (18)  serves  as  a  “constraint”  in  the  inversion.  In 
this  manner,  (18)  provides  an  a  priori  estimate,  while 
full  inversion  of  (14),  (15),  and  (16)  determines  the  best 
estimate  of  ly* . 

In  expressions  (17)  and  (18),  e  is  meant  to  denote 
some  suitably  averaged  function  of  the  dissipation  data. 
Averaging  over  multiple  profiles  is  necessary  to  achieve 
statistical  stability  in  the  mean  dissipation  estimates. 
Since  our  inverse  model  utilizes  the  steady-state  forms 
of  the  advective  budgets,  we  seek  an  averaging  proce¬ 
dure  that  yields  an  estimate  of  the  time-mean  dissipa¬ 
tion  rate.  Thus,  the  dissipation  data  were  averaged  in 
both  space  and  time.  The  time  averaging  was  weighted 
by  a  function  of  the  squared  barotropic  tidal-current 
speed  to  remove  biases  associated  with  a  spring-neap 
modulation.  Furthermore,  dissipation  data  were  spa¬ 
tially  averaged  according  to  a  bathymetric  classification 
scheme.  Dissipation  profiles  above  FZ  canyons,  crests, 
and  the  sloping  topography  between,  were  treated  sep¬ 
arately  in  the  averaging.  The  profile  data,  classified  in 
this  manner,  are  shown  in  Fig.  4.  Each  ensemble  profile 
results  from  data  that  has  been  vertically  averaged  into 
100-m  bins.  Dissipation  decreases  with  height  above 
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Figure  6.  Diapycnal  advection  estimates  for  the  =  28.16  (upper  panel)  and  <7„  =  28.20  (lower  panel)  isopycnals. 
The  lateral  flow  for  this  density  layer  (from  Fig.  5)  is  shown  in  the  middle  panel.  Contoured  estimates  of  diapycnal 
advection  have  been  averaged  over  1  °  x  1°  Mercator  cells  which  include  regions  blocked  by  topography. 
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bottom  in  each  of  the  3  classes,  and  all  profiles  reach 
background  levels  of  dissipation,  e  =  3  x  10"^®  W  kg“^ 
at  heights  greater  than  1000  m.  The  slope  profile 
is  notably  different  from  the  other  two,  in  both  the 
magnitude  of  the  maximum  dissipation  and  the  scale 
of  decay.  The  slope  profile  has  a  bottom  value  of 
€  =  (3  —  9)  X  10“^  W  kg“^  and  decays  with  an  e-fold 
scale  of  (150  ±  50)  m.  The  crest  and  canyon  profiles 
peak  at  €  =  (2  —  5)  x  10“®  W  kg“^  and  decay  over  a 
larger  scale,  (500  it  100)  m. 

A  matrix  model  for  a  6  density  layer  system  con¬ 
sisting  of  about  7000  equations  for  3200  unknowns  was 
formed  using  (14),  (15),  (16),  (17),  and  (18).  The  de¬ 
tails  of  the  linear  inversion  are  given  by  St  Laurent 
et  al  (2001).  In  a  series  of  initial  calculations,  lateral 
diffusivity  /c  was  treated  as  constant  along  each  den¬ 
sity  surface  used  in  the  inversion.  Using  this  approach, 
the  estimates  of  k  were  inconsistent  with  the  a  priori 
estimates  of  «  ~  100  m^  s“^  derived  from  float  obser¬ 
vations  {Hogg  and  Owens^  1999).  Spatial  variations  of 
K  on  isopycnals  intersecting  topography  may  account 
for  this  discrepancy.  Additionally,  diagnostic  scaling 
with  K  ^  100  m^  s”^  indicated  that  a  primary  thermal- 
advection  balance  (16)  occurs  between  the  terms  in¬ 
volving  advection  (both  lateral  and  vertical)  and  the 
vertical  diffusion  term.  Scale  estimates  suggest  that 
the  magnitude  of  the  lateral  diffusive  flux  terms  in  (16) 
are  less  than  10%  of  the  diapycnal  diffusive  flux  of  po¬ 
tential  temperature.  For  these  reasons,  terms  involving 
the  lateral  diflFusivity  were  dropped  from  the  thermal 
advection  equations,  leaving  the  three  dimensional  flow 
field  (u,  w*)  as  the  model  unknowns. 


Flow  in  and  above  abyssal  canyons 

Here,  we  present  the  estimated  circulation  in  and 
above  the  FZ  canyons.  A  more  complete  discussion  of 
the  circulation  at  all  depths  in  this  region  of  the  Brazil 
Basin  is  given  by  St  Laurent  et  al  (2001).  In  the  layers 
deeper  than  3000  m,  bottom  bathymetry  has  a  clear  in¬ 
fluence  on  the  flow  dynamics.  In  particular,  proximity 
to  the  bottom  is  the  controlling  factor  for  the  magni¬ 
tude  of  the  diabatic  forcing.  The  vertical  diffusivity 
characterizing  these  layers  is  ~  1  x  10“^  m^  s“^ 
or  greater,  and  this  corresponds  to  an  increased  im¬ 
portance  of  the  diabatic  terms  in  the  governing  equa¬ 
tions.  Flow  was  estimated  in  the  neutral  density  layer 
28.16  <  an  <  28.20  (Fig.  5).  The  upper  isopycnal  for 
this  layer  {an  =  28.16)  rests  just  above  the  level  of  the 
FZ  crests  at  16® W.  Near  the  MAR,  there  is  westward 
flow  out  of  canyons  with  average  current  magnitudes  of 
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(2-4)  mm  s  Further  west,  the  density  layer’s  average 
height  above  bottom  increases,  and  flow  at  500  m  above 
the  level  of  FZ  crests  is  southward  with  average  current 
magnitudes  of  (5-6)  mm  s“^. 

Diapycnal  advection  estimates  are  presented  in  Fig.  6. 
These  contoured  fields  of  have  been  highly  smoothed. 
Where  FZ  crest  topography  intersects  each  density  sur¬ 
face,  there  is  no  flow,  and  the  contoured  maps  show 
estimates  of  that  have  been  averaged  in  1°  x  1®  cells 
which  often  include  regions  blocked  by  topography.  The 
an  =  28.16  and  an  =  28.20  isopycnals  are  characterized 
by  diapycnal  downwelling  to  the  west  and  diapycnal  up- 
welling  to  the  east.  Estimates  of  upward  advection  in 
canyons  exceed  =  30  m  and  are  as  large  as 

=  100  m  yr“^  in  some  local  regions.  Uncertainty 
estimates  for  the  diapycnal  flow  are  typically  less  than 
±40%  of  the  contoured  values. 

To  produce  a  clearer  summary  of  the  deep  circulation 
in  and  above  canyons,  a  zonal  stream  function  {xp)  was 
defined  by  latitudinally  integrating  the  velocity  so  that 
uSy  =  -xpz  and  uu5y  =  where  the  latitudinal  limits 
of  integration  were  taken  over  the  inverse  domain  (19  ®S 
-  25®S), 


u{x,  z)  =  f  dyu{x,y,z), 

^(x,z)  =  J  dyw.(x,y,z). 


(19) 


An  objective  analysis  technique  was  used  to  interpo¬ 
late  the  inverse  solution  and  compute  the  stream  func¬ 
tion.  The  resulting  smoothed  stream-function  is  shown 
in  Fig.  7  relative  to  representative  bathymetry  and  the 
density  field.  These  estimates  suggest  eastward  flow 
in  canyons.  Diapycnal  upwelling  occurs  at  sites  where 
isopycnals  in-crop  along  canyon  floors,  and  this  up- 
welling  feeds  the  westward  flow  at  the  level  of  the  FZ 
crests. 


Deep  upwelling  and  mixing  efficiency 

As  presented  above,  strong  diapycnal  upwelling  is  re¬ 
quired  to  close  the  heat  and  mass  budgets  in  regions 
where  density  surfaces  intersect  canyon  floors.  Figure  8 
presents  a  comparison  of  the  a  priori  estimates  of  tu, 
to  the  inverse  solutions  for  w.  on  the  o-„  =  28.20  neu¬ 
tral  surface.  Inverse  estimates  are  generally  consistent 
with  the  a  priori  estimates  of  w.  at  sites  where  the 
On  =  28.20  surface  is  more  than  300  m  above  the  bot¬ 
tom.  However,  the  inverse  estimates  suggest  diapycnzil 
upwelling  is  favored  at  sites  where  the  cr„  =  28.20  sur¬ 
face  is  within  300  m  of  the  bottom. 
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Figure  7.  The  meridionally  integrated  stream  function  as  estimated  through  objective  analysis  of  the  inverse 
model  solution.  Dimensional  values  of  the  stream  function  are  contoured,  and  uncertainties  are  roughly  ±40%. 
Representative  bathymetry  of  the  canyon  floors  (shaded)  and  crests  (gray  line)  is  shown,  as  are  the  an  ==  28.10,  28,16 
and  28.20  isopycnals. 


Diapycnal  upwelling  must  be  occurring  in  regions 
where  turbulence  supports  a  convergent  buoyancy  flux: 
w^N^  =  —dJhjdz  >  0.  This  condition  is  satisfled  by 
turbulence  occurring  close  to  the  bottom,  since  the 
buoyancy  flux  must  decrease  with  depth  to  meet  a  no¬ 
flux  condition  at  the  sea  floor.  Within  the  context  of 
(7),  the  buoyancy  flux  is  related  to  the  dissipation  rate 
by  a  mixing  efficiency  parameter,  J5  =  — Fe.  We  believe 
that  a  reduction  in  mixing  efficiency  near  the  bottom 
allows  the  no-flux  condition  to  be  met  at  the  sea  floor. 
This  reduction  may  occur  in  a  1  to  10  m  thick  bound¬ 
ary  layer  (i.e.,  the  “log  layer”),  which  was  unresolved 
by  our  measurements.  However,  variations  in  mixing 
efficiency  may  extend  to  greater  heights  in  the  canyons. 
The  physical  mechanisms  controlling  the  efficiency  of 
mixing  near  topography  are  not  well  understood,  but 
turbulent  mixing  in  regions  enclosed  by  topography  may 
favor  mixing  with  reduced  efficiency.  Stigebrandt  and 
Aure  (1989)  found  that  mixing  in  fjords  occurred  at  0.05 
efficiency.  Given  our  observations  of  dissipation  and 
stratification  along  canyon  slopes,  a  decrease  in  mixing 
efficiency  from  0.2  to  0.05  in  the  bottom  most  100  m 
above  FZ  slopes  would  account  for  diapycnal  upwelling 
as  large  as  u;*  =  ±300  m  yr“^. 


Discussion 

We  have  described  buoyancy  forcing  and  circulation 
occurring  in  a  region  of  the  abyssal  ocean.  Our  cal¬ 
culations  suggest  that  enhanced  levels  of  turbulence 
above  rough  bathymetry  lead  to  enhanced  levels  of  di- 
abatic  flow.  Divergence  of  the  diabatic  flow  acts  as  a 
mechanism  of  buoyancy  forcing  for  the  lateral  circula¬ 
tion  though  the  vortex  stretching  term  {f /h){w^{zu)  — 
w^{zi))  in  the  vorticity  budget.  The  significance  of  di¬ 
abatic  vortex  stretching  as  a  forcing  mechanism  must 
be  assessed  relative  to  other  forcing  agents  for  the 
flow.  From  the  inverse  estimates,  we  may  compare  the 
strength  of  buoyancy  forcing  to  the  adiabatic  mecha¬ 
nism  of  vortex  stretching,  represented  by  {f  /h)u  •  V/i 
in  the  vorticity  budget. 

Above  the  level  of  FZ  crests,  adiabatic  vortex  stretch¬ 
ing  is  cyclonic,  while  diabatic  vortex  stretching  is  anti- 
cyclonic.  In  the  region  where  mixing  rates  are  largest, 
these  two  mechanisms  are  comparable  in  magnitude  and 
the  vortex  stretching  terms  in  (6)  nearly  cancel.  Since 
Pv  =  0,  there  is  little  meridional  flow  just  above  the 
level  of  FZ  crests  (Fig.  5).  The  divergence  of  diapycnal 
mass  flux  is  clearly  a  signiflcant  forcing  agent  on  the 
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Figure  8.  Comparison  of  the  a  priori  estimates  of  li;*  with  the  inverse  estimates  for  the  cr^  =  28.20  neutral  density 
surface.  Estimates  at  sites  where  the  an  =  28.20  surface  is  more  than  300  m  above  the  bottom  are  plotted  using 
triangles,  and  estimates  from  sites  where  an  =  28.20  is  less  than  300  m  above  the  bottom  are  plotted  as  points. 
The  curve  along  which  the  inverse  and  a  priori  estimates  are  equal  is  shown.  Standard  errors  are  shown  for  all 
estimates. 


lateral  circulation  in  this  region.  Further  west  of  the 
Mid  Atlantic  Ridge  where  mixing  rates  are  weaker,  the 
adiabatic  stretching  exceeds  the  diabatic  stretching,  re¬ 
sulting  in  the  southward  flow  where  the  largest  lateral 
velocities  occur.  In  addition  to  the  stretching  mecha¬ 
nisms,  friction  imposed  by  topography  may  also  be  a 
significant  forcing  mechanism  near  the  bottom,  though 
we  have  not  attempted  to  assess  this  contribution. 

The  diapycnal  advection  estimates  presented  here 
are  clearly  sensitive  to  assumptions  made  about  tur¬ 
bulent  mixing  efficiency.  As  indicated  by  (9),  depth 
variations  in  mixing  efficiency  will  influence  both  the 
magnitude  and  direction  of  the  diabatic  mass  flux.  We 
note  that  most  data  are  consistent  with  a  mixing  ef¬ 
ficiency  Rf  near  15%  (equivalently,  F  =  0.20),  For 
example,  diffusivity  estimates  from  (17),  which  assume 
r  =  0.2  ±  0,04  agree  with  the  turbulent  diffusivity  es¬ 
timated  from  the  tracer  dispersion  experiment  {Ledwell 
et  al,  2000).  Moreover,  St  Laurent  et  al  (2001)  find 
general  agreement  between  a  priori  diapycnal  advection 
estimates  estimated  with  F  =  0.20  in  (18)  and  diapyc¬ 
nal  advection  estimates  deduced  from  inversion  of  the 
steady-state  budgets.  Variations  in  mixing  efficiency 


are  only  inferred  from  estimates  deep  in  FZ  canyons,  in 
regions  where  isopycnal  slopes  increase  as  density  lay¬ 
ers  in-crop  along  the  bottom.  Our  evidence  for  reduced 
mixing  efficiencies  is  indirect,  but  the  diapycnal  ad¬ 
vection  estimates  are  consistent  with  a  reduction  from 
F  =  0.20  to  F  =  0.05  over  a  depth  of  100  m  above  the 
sloping  walls  of  FZ  canyons. 

While  the  study  strictly  considered  data  from  a  small 
geographic  region,  the  bathymetric  features  present  are 
ubiquitous  over  the  basins  of  the  global  ocean,  as  are 
the  tides.  If,  as  suggested  by  Munk  and  Wunsch  (1998), 
the  tides  are  the  dominant  source  of  mechanical  energy 
for  abyssal  mixing,  the  pattern  of  enhanced  turbulence 
observed  over  Brazil  Basin  fracture  zones  may  be  repre¬ 
sentative  of  turbulence  occurring  in  other  regions.  Dia¬ 
batic  forcing  above  rough  topography  may  be  a  primary 
forcing  mechanism  for  the  abyssal  circulation. 
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Abstract.  Double  diffusive  phenomena,  due  to  the  difference  in  molecular  con¬ 
duction  coefficients  for  heat  and  for  ion  concentrations  in  sea  water,  are  widely 
known.  These  include  salt  fingering  and  diffusive  convection  which  occur  when 
either  temperature  or  salinity  is  destabilizing  while  the  other  (salinity  or  heat)  is 
stabilizing.  Less  well  known,  less  dramatic  in  their  occurrence,  but  far  more 
widespread  are  circumstances  where  both  temperature  and  salinity  are  stabilizing 
and  yet  the  differences  in  molecular  conduction  lead  to  systematic  differences  in 
turbulent  transport  rates.  Differential  diffusion”  has  been  seen  in  laboratory  ex¬ 
periments,  in  ocean  observations  and  in  numerical  simulations.  Here  we  examine 
differential  diffusion  in  terms  of  turbulence-induced  restratifying  (up-gradient) 
fluxes. 


Introduction 

Double  diffusion  occurs  in  the  ocean  because  heat  and 
salt  (ion  concentration)  experience  very  different  coeffi¬ 
cients  of  molecular  conduction.  Heat  conduction  has  about 
100  X  larger  coefficient  than  ionic  conduction.  As  both 
heat  and  salt  determine  density,  circumstances  arise  where 
the  potential  density  profile  p(z)  is  stable  although  the  con¬ 
tribution  from  either  temperature  T(z)  or  salinity  S{z) 
would,  by  itself,  be  gravitationally  unstable.  Many  studies 
and  a  large  literature  have  addressed  these  doubly  diffusive 
phenomena,  as  Kelley  (2001)  has  reviewed  at  this  work¬ 
shop. 

Over  the  major  part  of  the  ocean  interior,  both  T(z)  and 
^(z)  are  stabilizing.  While  lateral  intrusive  instabilities 
may  occur,  most  double  diffusive  phenomena  are  not  al¬ 
lowed.  The  prevailing  view  is  that  both  f  and  5  are  simply 
mixed  by  occasional  background  turbulence.  Turbulent 
diffusivities  for  T  and  for  5  are  assumed  to  be  equal. 
However  prevalent  this  view,  it  may  not  be  quite  so. 

Do  the  differences  of  molecular  conduction  cause  dif¬ 
ferences  for  the  turbulent  (advective  part)  fluxes?  Labora¬ 
tory  evidence  and  numerical  experiments  show  that  they 
do.  In  early  experiments.  Turner  (1968)  mechanically 
agitated  a  fluid  stratified  ivith  respect  to  T  and,  separately, 
a  fluid  stratified  with  respect  to  S,  taking  care  that  the  two 
stratifications  were  initially  the  same.  Under  the  same 
mechanical  agitation,  it  was  found  that  T  was  mixed  more 
efficiently  than  5"  by  an  amount  greater  than  could  be  at¬ 
tributed  to  molecular  conduction.  Later,  Altman  and  Gar¬ 
gett  (1980)  performed  similar  experiments  in  which  a  tank 


was  initially  stably  stratified  with  respect  to  both  T  and  S, 
arranged  so  that  both  made  the  same  initial  contribution  to 
stratification.  Again  it  was  seen  that  T  mixed  more  readily 
than  S  by  amounts  exceeding  molecular  conduction.  Direct 
observation  of  differential  diffusion  in  the  ocean  is  hugely 
technically  difficult,  and  first  results  are  just  now  being 
reported  by  Nash  and  Mourn  (2001). 

Attempting  to  quantify  differential  diffusion,  one  may 
employ  direct  numerical  simulations.  A  major  challenge 
to  this  approach  is  the  necessity  to  resolve  widely  disparate 
scales,  with  the  dissipation  scale  for  S  nearly  100  times 
smaller  than  the  dissipation  scale  for  T,  itself  already  10 
times  smaller  than  the  dissipation  scale  for  the  advecting 
velocity  field.  A  limited  investigation  based  upon  numeri¬ 
cal  simulations  was  performed  by  Merryfield  et  al.  (1998) 
employing  the  idealization  that  all  fields  were  independent 
of  one  horizontal  coordinate,  i.e.,  two-dimensionalizing  the 
problem.  By  not  resolving  three  dimensions,  more  com¬ 
puting  resource  could  be  applied  to  wider  range  of  scales 
in  two  dimensions. 

Forcing  with  random  velocity  fields,  Merryfield  et  al. 
found  results  consistent  with  laboratory  observations, 
namely  that  turbulent  fluxes  (not  including  explicit  diffu¬ 
sive  fluxes  driven  by  background  gradients)  transferred 
heat  more  rapidly  than  salt.  Differential  transfer  rates  were 
explored  as  dependent  upon  flow  parameters.  However, 
these  results  were  subject  to  doubt  because  the  two- 
dimensionalization  very  much  alters  the  nature  of  “turbu¬ 
lence  ,  trapping  kinetic  energy  in  large,  non-dissipating 
scales. 

As  far  as  computing  resources  have  allowed,  we  have 
now  performed  a  suite  of  experiments  in  fully  three  dimen- 
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sions.  In  this  note  we  report  preliminary  results  from  these 
experiments  and  a  schematic  explanation  to  account  for  the 
occurrence  of  differential  diffusion. 

Direct  simulations  in  3D 

We  solve  the  Boussinesq  equations  without  Earth’s  ro¬ 
tation: 

|^  +  (u.V)u  =  -— Vp  +  -^g+vV=u  ... 

3/  Po  Po  (1) 


We  then  randomly  select  an  isotropic  initial  u  with  en¬ 
ergy  spectrum  approximately  while  initial  7"' and  S'  are 
nil.  Amplitude  of  the  initial  u  are  adjusted  for  experiments 
in  which  a  turbulent  Froude  number  takes  values  from  0.4 
(“weak”)  to  48  (“strong”).  During  the  subsequent  decay, 
we  diagnose  vertical  heat  and  salt  fluxes,  vi^T'and 
evaluating  their  cross-spectra  at  selected  times  and  evaluat¬ 
ing  time-integrated  fluxes.  A  more  complete  account  will 
be  given  by  Gargett  et  al.  (2001).  Here  we  sketch  only 
key  results  and  an  explanation. 


V*u  =  0 


dt 

df 


+  (u.V)P  =  k’.,V'7’ 
+  (yV)5’  =  K',V'5 


(2) 

(3) 

(4) 


where  u=(f/,v,vr)  is  velocity,  T  is  temperature,  S  is  salin¬ 
ity,  p  is  density,  p  is  pressure,  g  is  gravity,  v  is  viscosity, 
Ky  and  Ks  are  coefficients  of  explicit  diffusion,  and  we 
assume  a  linear  equation  of  state 


p^p„[\-a{T-T,)+p{S-S„)] 


(5) 


T  and  S  are  expanded  about  a  prescribed  (non- varying) 
background 


such  that 


7’(.vo',z)=^,.^+^'(.v■o^z) 
S  ( A',  y ,  r )  =  VyZ  +  S'  (  a-,  y,  z ) 

/?^,=ar,//3r,  =  ^i 


(6) 

(7) 


Results 

First,  we’ve  compared  output  with  previous  2D  experi¬ 
ments  reported  by  Merry  field  et  al.  (1998).  For  this  com¬ 
parison,  we  have  rerun  the  earlier  2D  simulations  but  with 
T^O.l,  consistent  with  present  3D  runs.  Results  (not 
shown  here)  are  pleasantly  reassuring,  tending  to  confirm 
the  value  of  earlier  2D  simulations  and  suggesting  that  our 
3D  T=0.1  results  can  be  extrapolated  to  T=0.01  by  analogy 
with  2D  results.  For  the  purpose  of  this  report,  we  show 
only  a  representative  case  from  3D  simulations.  Figures  1 
and  2  show  evolution  in  time  of  several  spectral  bands 
from  cross-spectra  -w'T' and  w'S'. 


Equations  (1)  through  (4)  are  solved  pseudo-spectrally, 
with  dealiasing,  using  filtered  leapfrog  timestepping.  Vis¬ 
cous  and  diffusive  terms  are  represented  by  exponential 
integrating  factors  in  Fourier  space.  The  great  difficulty, 
even  in  our  age  of  modern  computing,  is  that  the  requisite 
range  of  scales  one  should  wish  to  resolve  in  3-space  ex¬ 
ceeds  computational  resource.  At  our  highest  resolution, 
we  employ  160  gridpoints  in  each  dimension  for  velocity 
and  temperature  fields  while  employing  twice  that  (320 
gridpoints)  for  salinity.  Even  so,  for  v  =  0.01  cm2/s,  kT 
=0.0014  cm2/s  and  IT  =10-4  °C/cm,  we  are  compelled  to 
work  in  a  cube  approximately  17  cm  tall  with  grid  spacing 
about  0.1  cm  for  u  and  T  and  0.05  cm  for  S.  For  salinity 
dissipation  scales  to  be  resolved,  we  must  limit  t=kS/kT  to 
0.1  where  a  more  realistic  value  would  be  near  0.01 . 


Figure  1.  Timeseries  of  -vt'T'  are  shown  for  several  vertical 
wavenumber  bands  from  red=Iow  kz  to  bliie=high  kz.  We  show 
w'T'  so  that  positive  values  represent  downward  flux,  hence 
downgradient  in  T  (usual  “mixing”).  A  half-buoyancy  period, 
n/N,  occurs  at  about  /=0.8  Arrows  mark  times  displayed  in  Fig¬ 
ure  3. 


Why? 

Numerical  simulations,  here  in  3D  and  previously  in  2D 
(Menyfield  et  a/., 1998),  are  consistent  with  the  sense  of 
laboratory  observations.  The  less  diffusive  scalar  is  trans¬ 
ported  by  velocity  fluctuations  less  effectively  than  the 
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Figure  2.  Timeseries  of  +vi>'5''are  shown  as  in  Figure  1 ,  plotted  so 
that  positive  is  upward  flux,  hence  downgradient  in  S.  To  aid 
comparison,  fluxes  in  each  spectral  band  have  been  rescaled  just 
after  initiation.  While  rescaling  enables  the  higher  wavenumber 
bands  to  be  displayed,  one  is  cautioned  that  this  enhances  percep¬ 
tion  of  bands  dominated  by  restratifying  fluxes. 

more  diffusive  scalar.  3D  experiments  were  performed 
over  a  range  of  Froude  numbers  0.4  to  48  and  turbulent 
Reynolds  numbers  1  to  80,  all  with  t=0.1  (not  0.01).  In 
the  higher  Fr  and  Re  cases,  we  obtain  flux  ratios  from  0.7 
to  0.9.  (At  lower  Fr  and  Re,  flux  ratios  can  be  0.2  and  less 
approaching  the  theoretical  limit  t=0.1  .  However,  in  these 
low  amplitude  cases,  the  velocity  fluctuation  induced 
transports  are  smaller  than  direct  conduction.)  Taking 
account  that  active  oceanic  turbulence  typically  occurs 
with  Re  greater  than  we  are  able  to  realize  in  numerical 
simulations,  and  that  actual  salt-heat  have  nearer  0.01,  our 
results  are  best  described  as  not  inconsistent  with  the  ob¬ 
servational  inference  from  Nash  and  Mourn  who  estimate 
flux  ratio  0.7  ±  20%.  A  more  complete  report  will  be  given 
by  Gargett  et  ai  (A.E.  Gargett,  W.J.  Merryfield,  and  G. 
Holloway,  Differential  diffusion  of  T  and  S:  Three- 
dimensional  numerical  simulations,  in  preparation). 

We  pause  to  ask  why  differential  diffusion  happens  at 
all,  and  why  it  has  the  sense  it  has.  One  might  have 
thought  ab  initio  that  the  sense  of  differential  diffusion 
would  be  opposite  to  that  observed.  If  we  thought  of 
“usual”  turbulence  transporting  a  “usual”  (passive)  scalar, 
say  C,  then  flux  cross-spectra  w'C'  should  remain  of 
downgradient  sense  at  all  scales.  Diffusivity  of  the  scalar 
would  cut  off  the  high  wavenumber  range  of  w'C'  and 
thus  a  more  diffusive  scalar  would  feel  less  turbulent 
transport.  Why  is  this  not  also  the  case  for  active  (buoy¬ 
ant)  tracer  in  stratified  flow? 

The  answer  is  seen  in  Figures  1,  2  and  3  in  the  preva¬ 
lence  of  upgradient  (restratifying)  fluxes.  Are  these  fluxes 
Figure  3.  Cross  spectra  of  ~  wT^  (thick  curves)  and  w'S'  (thin 
curves)  are  plotted  versus  kz  at  the  three  times  marked  in  Figures 


1  and  2  with  blue==early,  gre  en=mid,  red=late.  Solid  curves  indi¬ 
cate  downgradient  (“mixing”)  fluxes  while  dashed  curves  repre¬ 
sent  upgradient  (“restratifying”)  fluxes. 


not  also  the  fluxes  reported  by  Carnevale  (this  proceed¬ 
ings)?  If  indeed  restratifying  fluxes  are  generic,  and  the 
restratifying  sense  dominates  the  higher  wavenumber 
range,  then  diffusive  suppression  of  higher  wavenumber 
cross-spectra  yields  enhanced  transprt  of  the  more  diffu¬ 
sive  tracer  simply  because  the  more  diffusive  tracer  ex¬ 
periences  less  restratification.  In  this  case  the  question 
would  move  from  “Why  differential  diffusion?”  to  “Why 
restratifying  fluxes?”  It  is  a  question  from  ‘Aha  Huliko’a 
since  more  than  a  decade  ago  {Holloway^  1 989),  yet  unre¬ 
solved. 

Conclusions 

Until  a  far  more  confident  dynamical  basis  is  estab¬ 
lished,  we  are  not  able  to  propose  a  parameterization 
scheme  for  differential  diffusion  in  the  bistable  environ¬ 
ment.  What  we  show  is  that  numerical  simulations,  both 
in  2D  and  here  in  3D,  confirm  laboratory  and  oceanic  ob¬ 
servations  that  heat  and  salt  do  not  mix  equally  efficiently. 
This  is  contrary  to  modelling  practice  for  which  the  mixing 
coefficients  for  heat  and  salt  are  equal  except  in  rare  in¬ 
stances  where  a  double-diffusive  parameterization  has 
been  considered.  For  the  large  fraction  of  world’s  oceans 
where  the  environment  is  stable  to  double  diffusion,  the 
assumption  of  equal  mixing  of  heat  and  salt  is  wrong. 
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Six  questions  about  double-diffusive  convection 

Dan  E.  Kelley 

Dalhousie  University,  Halifax,  Nova  Scotia,  Canada 


Abstract.  Double-diiffusive  convection,  at  first  considered  an  “oceanographic 
curiosity,”  has  fascinated  fiuid  dynamicists  for  a  generation.  This  is  partly 
because  some  early  and  basic  questions  have  still  not  been  answered.  The 
most  practical  of  these  -  whether  double  diffusion  is  important  in  the  ocean 
-  was  raised  in  the  defining  paper,  when  Stem  (1960)  wrote  ''future  studies 
of  this  model ...  will  determine  whether  the  proposed  mechanism  is  significant 
in  the  vertical  mixing  of  the  sea''  This  question  of  significance  has  not  been 
answered  yet,  and  may  not  be  xmtil  we  can  answer  a  host  of  fundamental 
questions  about  mechanisms  and  interactions,  some  of  which  are  highlighted 
here. 


Introduction 

Double-diffusive  phenomena^  occur  in  diverse  sys¬ 
tems,  ranging  from  stars  to  magma  chambers,  with 
oceans  in  between.  Studies  of  cirrus  clouds  almost  led  to 
the  discovery  of  the  mechanism  of  double  diffusion  (DD 
henceforth^)  in  the  1800s  {Schmitt,  1995),  but  it  was  a 
group  of  oceanographers  who  eventually  made  the  dis¬ 
covery  a  full  century  later.  While  pondering  whether 
they  could  measure  deep  ocean  pressures  by  lowering 
pipes  from  the  surface,  the  oceanographers  conceived  of 
a  perpetual  salt  fountain  {Stommel  et  al,  1956),  a  fan¬ 
ciful  idea  that  soon  developed  into  a  theory  of  SF  and  of 
DD  convection  in  general  {Stem,  1960).  An  analogy  be¬ 
tween  convecting  layers  created  in  the  laboratory  and 
layers  newly  observed  in  the  ocean  {Stommel  and  Fe¬ 
dorov,  1967;  Tait  and  Howe,  1968)  suggested  that  DD 
was  significant  to  the  ocean,  and  this  was  supported 
by  estimates  of  large  oceanic  DD  fluxes,  inferred  using 
laboratory-based  flux  laws  {Turner,  1965, 1967).  In  ad¬ 
dition  to  this  work  on  the  case  with  backgroxmd  T  and  5 
fields^  varying  vertically,  attention  was  paid  to  the  case 
of  horizontal  variation,  as  in  the  theory  that  divergent 


will  assume  readers  are  familiar  with  double  diffusion  in  the 
ocean;  see  recent  reviews  by  Schmitt  (1994)  and  Fernando  and 
Brandt  (1994). 

^Abbreviations:  “DD”  for  double  diffusion;  ‘WS”  for  rela¬ 
tively  warm  and  salty;  “CF”  for  relatively  cold  and  fresh;  “SF” 
for  the  salt-finger  mode  of  DD,  possible  when  WS  water  overlays 
CF  water;  and  “DC”  for  the  reverse  case  of  diffusive  convection. 

^Notation:  T  for  temperature;  S  for  salinity;  a  for  thermal 
expansion  coefficient;  p  for  haline  contraction  coefficient;  «  for 
thermal  diffusivity;  ks  for  haline  diffusivity;  i/  for  kinematic  vis¬ 
cosity;  z  for  upward  coordinate;  Rp  =  {otdT/dz)/{PdS/dz)  for 
the  density  ratio  in  the  SF  case  or  its  reciprocal  for  the  DC  case,  or 
either  in  finite-difference  form;  Nu  for  Nusselt  number,  a  nondi- 
mensional  heat  flux;  Ra  for  Rayleigh  number,  etc.,  as  in  Tamer 
(1973). 
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Figure  1.  Definition  sketch  of  background  gradients  in  the 
linear  instability  view  of  “staircase”  double  diffusion  (SF  in 
left  panel,  DC  in  right  panel),  with  salinity  and  temperature 
gradients  aligned  parallel  in  the  vertical. 


DD  fluxes  might  drive  interleaving  across  thermohaline 
fronts  {Stem,  1967). 

The  dates  in  the  last  paragraph  suggest  that  the 
key  modem  ideas  about  oceanic  DD  were  developed  in 
little  more  than  a  decade.  However,  even  that  may 
be  an  overestimate.  Stommel  (1995)  summarized  the 
thought  progression  from  salt  foimtains  to  salt  fingers 
imder  the  remarkable  heading  “Exciting  Ten  Minutes 
at  the  Blackboard.”  But,  whether  it  took  ten  minutes 
or  ten  years,  one  thing  is  clear:  the  central  ideas  of  DD 
research  came  to  light  in  a  short  time.  It  might  also  be 
noted  that  these  ideas  were  set  out  a  full  generation  ago. 
A  student  could  well  ask  whether  there  are  significant 
research  questions  left  for  the  next  generation.  I  think 
there  are,  and  I  plan  to  outline  some  of  them  here,  in 
hopes  of  encouraging  discussion  and  future  work. 
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Ql.  Where  does  DD  exist  in  the  ocean? 

Staircases.  Instability  theory  suggests  that  DD  con¬ 
vection  may  occur  if  large-scale  gradients  of  S  and  T 
are  oriented  vertically  in  the  same  direction  (Figure  1). 
This  sets  a  very  wide  domain  indeed.  For  example,  In¬ 
gham  (1966)  estimated  that  90%  of  the  Atlantic  Ocean 
main  thermocline  has  SF-unstable  stratification.  Fur¬ 
thermore,  stratification  at  high  latitudes  is  very  com¬ 
monly  DC-unstable. 

A  possible  test  of  whether  DD  is  significant  in  a 
given  region  may  be  whether  regular-shaped  thermoha¬ 
line  staircases  are  observed  there.  This  rests  on  the  as¬ 
sumption  that  regular-shaped  staircases  can  result  only 
if  up-gradient  DD  buoyancy  fluxes,  which  create  layers, 
exceed  down-gradient  turbulent  buoyancy  fluxes,  which 
disrupt  layers.  If  we  had  a  solid  conception  of  how  DD 
signatures  are  formed,  it  might  be  feasible  to  test  this 
notion  by  comparing  spatial  patterns  of  DD  signatures 
with  those  of  mixing  rates.  Unfortunately,  we  cannot 
claim  certainty  about  how  DD  signatures  are  formed 
(see  Q2  and  Q3  below),  nor  can  we  map  easily  the  spa¬ 
tial  patterns  of  mixing  rates.  Even  so,  it  might  be  useful 
to  map  DD  occurrence  patterns,  in  order  to  guide  efforts 
to  understand  the  DD  processes. 

For  example,  it  has  been  noted  that  SF  staircases 
are  mainly  seen  when  Rp  is  less  than  about  2.  Why  is 
this  so,  and  what  sets  the  critical  value  of  Rp?  Several 
answers  have  been  put  forward.  Most  of  these  relate  to 
competition,  since  instability  theory  suggests  that  SF 
should  be  possible  up  to  Rp  =  k/ks  ~  100.  For  ex¬ 
ample,  perhaps  the  SF  growth  rate  needs  to  exceed  the 
buoyancy  frequency  N,  if  the  latter  sets  a  timescale  for 
disruption.  Or  perhaps  it  needs  to  exceed  the  large- 
scale  shear  dUjdz,  if  SF  tilting  is  the  main  issue.  Or 
perhaps  it  needs  to  exceed  the  Coriolis  parameter  /,  if 
inertial  turning  sets  a  limit.  Other  possibilities  could  be 
mentioned.  While  laboratory,  theoretical  and  numeri¬ 
cal  work  on  each  possibility  would  be  welcome,  efforts 
could  be  more  focused  if  field  studies  were  undertaken  to 
hint  at  which  physical  effects  are  most  relevant.  For  ex¬ 
ample,  consider  the  coriolis  parameter.  Schmitt  (1994) 
points  out  that  the  Kunze  (1990)  model  of  disruption  by 
inertial  waves  implies  that  the  observed  maximimi  Rp 
for  regular-shaped  SF  staircases  should  decrease  with 
increasing  latitude,  and  that  this  qualitative  pattern 
seems  to  be  hold  in  the  ocean.  Does  this  provide  firm 
support  for  the  hypothesis  of  inertial-turning  limitation, 
or  is  it  a  coincidence? 

The  DC  case  provides  a  marked  contrast  to  the  SF 
case,  since  regular-shaped  staircases  are  routinely  ob¬ 
served  for  (DC-formulated)  Rp  values  ranging  up  to  at 
least  10.  Is  the  nature  of  disruption  different  in  the  two 
cases,  e.p.  with  shear  inhibiting  SF  fluxes  {Kunze,  1994) 


but  not  DC  fluxes  {P adman,  1994)?  Or  is  it  just  that 
DC  survives  to  higher  density  ratios  because  disruption 
is  weaker  in  regions  where  DC-unstable  stratification 
exists,  such  as  the  Arctic? 

Interleaving.  Predicting  regions  in  which  DD  inter¬ 
leaving  might  occur  is  more  difficult  than  doing  so  for 
staircases.  Interleaving  has  been  observed  near  thermo¬ 
haline  fronts'*  in  many  regions  of  the  world  ocean  {May, 
1999).  A  dramatic  example  is  provided  by  the  Arctic, 
which  has  interleaving  signatures  with  remarkable  spa¬ 
tial  and  temporal  coherence  {Rudels  et  al,  1999).  To 
date,  sampling  of  the  interleaving  mode  has  been  very 
limited.  Extensive  field  programs  are  needed  to  develop 
a  clearer  picture  of  interleaving.  It  is  important  that 
such  sampling  be  on  a  grid,  not  on  isolated  transects, 
since  only  then  can  the  along-front  and  across-front 
slopes  of  intrusions  be  measured.  These  slopes  are  key 
dynamical  indicators  that  might  help  us  to  select  from 
competing  theories,  and  select  we  must.  For  example,  it 
is  still  unclear  (see  Q3  below)  whether  Arctic  interleav¬ 
ing  results  from  DD  processes  {May  and  Kelley,  2001) 
or  from  differential  mixing  {Hebert,  1999;  Merryfield, 
2001),  and  surely  that  is  a  first-order  question! 

Q2.  What  creates  staircases? 

Collective-instability  mode.  An  early  hypothesis  for 
staircase  formation  was  a  collective-instability  mode,  in 
which  the  SF  set  up  internal  waves  that  in  turn  disrupt 
the  SF,  yielding  a  system  of  sheets  and  layers  {Stem, 
1969;  Stem  and  Turner,  1969).  Questions  remain  as 
to  the  relevance  of  the  mechanism  as  a  general  cause 
of  staircase  formation.  One  is  whether  the  idea  can  be 
reformulated  for  the  DC  case^.  Others  relate  to  the 
details  of  the  proposed  mechanism  limiting  SF  length 
{Kunze,  1987,  1990,  1994). 

Variable-difFusivity  mode.  The  vertical  mode  of  DD 
extracts  potential  energy  from  the  gravitationally  desta¬ 
bilizing  component  of  density,  transporting  buoyancy  in 
the  up-gradient  direction.  Associated  flux  divergences 
have  been  hypothesized  as  a  cause  of  staircase  forma¬ 
tion  {Ruddick,  1997),  by  analogy  to  an  hypothesis  for 
the  creation  of  steppiness  in  DD-stable  fluids  by  flux 
divergences  arising  from  turbulent  diffiisivities  that  de¬ 
pend  on  the  buoyancy  gradient  {Phillips,  1972;  Ruddick 
et  al,  1989).  Preliminary  tests  along  these  lines  have 
been  carried  out  via  ID  numerical  simulations  {Merry- 
field,  2000)  but  questions  remain  about  flux  parameter- 
izations,  boundary  conditions,  etc.  Indeed,  the  impor- 

thermohaline  front  is  taken  here  to  mean  a  front  with  co¬ 
varying  5  and  T  fields,  but  not  necessarily  with  flat  isopycnals. 
See  Figure  2, 

®  However,  the  search  for  parallelism  between  the  SF  and  DC 
cases  owes  as  much  to  aesthetic  desire  as  to  physical  principle. 
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tance  of  boundary  conditions  is  difficult  to  overstate: 
direct  numerical  simulations  of  the  SF  case  show  layer 
formation  with  insulating  top/bottom  boundary  con¬ 
ditions  {Ozgokmen  et  al,  1998)  but  not  with  periodic 
boimdary  conditions  (Merry field  and  Grinder,  2001). 
Shear-modulated  mode.  The  inhibition  of  SF  fluxes 
by  shear  (Linden,  1971;  Kunze,  1990,  1994)  could  lead 
to  vertical  variations  of  buoyancy  flux  associated  with 
vertical  variations  in  shear,  and  this  has  been  hypothe¬ 
sized  as  a  mechanism  for  the  generation  of  flne-structure 
in  SF-unstable  regions  ( Wells  et  al,  2001).  A  challenge 
in  taking  this  to  provide  a  full  explanation  of  staircase 
formation  is  the  requirement  that  the  spatial  pattern  of 
shear  match  that  of  observed  staircases.  Whether  that 
holds  or  not,  this  process  could  play  a  collaborative  role 
in  other  mechanisms. 

Applied-flux  mode.  Turner  (1968a)  proposed  a 
mechanism  for  the  creation  of  thermohaline  staircases 
by  the  application  of  destabilizing  buoyancy  flux,  e.g. 
when  a  salt  gradient  is  heated  from  below.  Further  lab¬ 
oratory  and  theoretical  treatments  have  added  details 
to  Turner’s  initial  sketch  of  this  mechanism  (Linden, 
1976;  Huppert  and  Linden,  1979;  Fernando,  1987)  and 
2D  numerical  simulations  have  added  color  to  the  pic¬ 
ture  (Molemaker  and  Dijkstra,  1997).  However,  ques¬ 
tions  remain  about  the  directness  of  the  analogy  of  the 
applied-flux  scenario  to  the  ocean,  where  staircases  ap¬ 
pear  at  mid-depth  and  where  fluxes  are  likely  not  to 
be  constant,  but  rather  to  depend  on  the  DD  response 
itself. 

Modifled-intrusion  mode.  The  ideas  outlined  above 
are  a  generation  old,  but  new  ideas  are  now  starting 
to  surface.  A  prime  example  is  the  Merryfield  (2000) 
proposal  that  staircases  might  result  from  intrusions. 
The  theory  produces  reasonable  predictions  of  oceanic 
staircase  observations.  Issues  remaining  to  be  resolved 
include  the  role  of  baroclinicity  (Kuzmina  and  Rodi¬ 
onov,  1992;  May  and  Kelley,  1997,  2001)  and,  as  usual, 
the  fundamental  imcertainty  about  how  to  parameter¬ 
ize  DD  fluxes  (see  Q5  below).  Another  issue  relates  to 
context:  what  sets  off  the  initial  interleaving?  In  some 
cases  we  may  answer  that  the  interleaving  results  from 
the  contact  of  watermasses  of  different  TS  character¬ 
istics.  However,  in  other  cases  (e.g.  perhaps  in  deep 
Arctic  basins)  there  may  not  be  a  great  deal  of  lateral 
variation  in  water  properties  to  set  off  interleaving. 

Q3.  What  creates  intrusions? 

Neighboring-watermass  mode.  It  is  common  to 
observe  interleaving  across  fronts  separating  WS  and 
CF  watermasses.  Stem  (1967)  presented  an  instabihty 
theory  for  this  process  that  has  since  been  extended 
greatly,  e.g.  allowing  for  friction  as  well  as  diffusion 
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Figure  2.  Background  gradients  in  the  Stem  (1967)  in¬ 
stability  theory  of  double-diffusive  interleaving  across  a 
barotropic  thermohaline  front,  for  background  gradients  of 
the  SF  (left)  and  DC  senses  (right).  The  dashed  lines  illus¬ 
trate  that  density  contributions  from  S  and  T  are  assumed 
to  compensate  laterally,  yielding  flat  isopycnals.  Is  this  a 
good  model  of  ocean  fronts? 


(Toole  and  Georgi,  1981),  allowing  for  more  general 
frontal  geometry  (Niino,  1986),  allowing  for  baroclin¬ 
icity  (Kuzmina  and  Rodionov,  1992;  May  and  Kelley, 
1997),  etc.  The  “allowing  for”  phrases  in  the  last  sen¬ 
tence  relate  to  some  of  the  most  basic  aspects  of  ocean 
physics,  and  this  might  suggest  that  this  theory  has 
not  borne  its  last  fruit  yet.  If  we  think  ocean  fronts 
might  be  misteady  on  the  timescale  of  interleaving,  if 
we  think  cross-frontal  contrasts  might  vary  with  depth, 
if  we  think  fronts  vary  in  the  downstream  direction,  if  we 
think  diffusivity-based  flux  laws  are  flawed,  if  we  think 
SF  and  DC  fluxes  could  act  at  the  same  time,  . . . ,  then 
we  may  not  be  surprised  to  see  more  extensions  of  the 
Stem  (1967)  idea.  Extending  such  analytical  models  of 
initial  growth  to  the  stage  of  finite-amplitude  evolution 
will  remain  a  challenge.  Probably  laboratory  work  will 
be  crucial  in  guiding  thinking,  as  it  has  been  histori¬ 
cally  (Ruddick  and  Turner,  1979;  Ruddick  et  al,  1998). 
Intermediate-scale  numerical  simulations  may  play  an 
increasing  role  in  developing  understanding,  but  imtil 
we  are  more  certain  how  to  parameterize  DD  fluxes  (see 
Q5  below)  a  cloud  will  hang  over  such  work,  as  it  does 
now  over  theoretical  treatments. 

Sloped-boundary  Mode.  Sloped  insulating  bound¬ 
aries  have  been  shown  to  create  interleaving  structures 
even  in  fluids  with  no  initial  horizontal  variations  in 
water  properties  (Turner,  1973;  Linden  and  Weber, 
1977).  This  scenario  deserves  more  study,  because  slop¬ 
ing  boundaries  are  common  in  the  ocean,  and  thermo¬ 
haline  currents  are  often  steered  along  them.  Might  the 
two  mechanisms,  the  watermass  and  boundary  modes, 
be  linked?  A  good  place  to  investigate  this  question 
might  be  the  Arctic,  where  interleaving  is  observed 
near  currents  of  WS  Atlantic  waters  that  appear  to  be 
steered  along  mid-ocean  ridges  (Rudels  et  al,  1999). 
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Differential-mixing  mode.  Incomplete  turbulent 
mixing  can  yield  differential  mixing  rates  for  heat  and 
salt,  owing  to  the  difference  between  the  molecular  dif- 
fusivities  of  heat  and  salt  acting  on  fluid  parcels  momen¬ 
tarily  put  into  contact  by  the  weak  turbulence  ( Turner ^ 
1968b;  Altman  and  Gargett,  1987;  Ruddick,  1997).  It  is 
not  yet  clear  how  to  parameterize  these  mixing  rates. 
The  2D  direct  numerical  simulations  of  Merry  field  et  al 
(1998)  confirm  the  expected,  i.e.  that  heat  diffuses 
faster  than  salt  and  that  the  effect  vanishes  if  mixing  is 
vigorous.  Soon,  we  may  have  flux  laws  that  have  been 
inferred  from  laboratory  work  and  3D  direct  numer¬ 
ical  simulations  being  done  today.  In  the  meantime, 
it  is  worth  noting  that  the  idea  of  differential  mixing 
may  hold  promise  in  answering  a  long-standing  ques¬ 
tion  in  intrusion  research,  namely  how  to  explain  intru¬ 
sions  seen  in  locations  that  have  DD-stable  background 
vertical  gradients.  The  conventional  explanations  are 
that  (a)  the  initial  lateral  displacements  were  not  in¬ 
finitesimal,  as  in  the  theories,  but  were  large  enough 
to  create  inversions,  (b)  we  are  observing  the  “noses” 
of  intrusions  extending  from  DD-\mstable  regions  into 
DD-stable  regions,  or  (c)  the  intrusions  are  fossilized 
signatures  in  a  background  field  that  was  previously 
DD-unstable.  The  idea  of  differential  mixing  provides 
a  new  hypothesis:  that  the  differential  mixing  of  5  and 
T  could  yield  density  convergence  analogous  to  the  DD 
case,  thus  driving  intrusions.  This  proposal  has  been 
put  forward  recently  in  a  general  context  by  Hebert 
(1999)  and  for  the  particular  case  of  Arctic  intrusions  by 
Merry  field  (2001).  The  former  author  points  out  that 
testing  the  scenario  is  problematic  in  terms  of  tests  in 
the  field,  since  a  key  diagnostic  is  the  cross-frontal  in¬ 
trusion  slope,  which  is  difficult  to  measure. 

Q4.  Do  staircases  and  intrusions  interact? 

Mixed  modes.  See  the  discussion  of  Q2  for  issues 
relating  to  the  Merry  field  (2000)  idea  of  transformation 
of  intrusions  into  staircases. 

Intrusions  within  staircases.  Although  some  stair¬ 
cases  display  remarkable  integrity  in  some  respects  (e.^. 
trends  in  layer  TS  properties),  they  are  certainly  not 
one-dimensional  structures  without  lateral  variation. 
This  is  revealed  by  high-resolution  sampling  (which  is, 
unfortunately,  rare).  For  example,  Padman  and  Dillon 
(1988)  found  that  station  spacings  of  less  than  about 
1  km  were  required  to  track  individual  layers  in  the 
DC  staircase  that  they  measured  in  the  Arctic  Canada 
Basin  with  microstructure  temperature  profiles.  Simi¬ 
larly,  towed-chain  thermistor  sampling  of  the  SF  stair¬ 
case  of  the  C-SALT  experiment  revealed  rich  variabil¬ 
ity  on  several  scales  and  of  several  physical  types  {Mar- 
morino^  1989, 1991).  For  example,  within  this  SF  stair¬ 


case,  Marmorino  (1991)  sees  evidence  of  the  expected 
convection  plumes  within  the  layers,  but  also  signs  of 
DC  interfaces  and  of  intrusions  within  the  layers.  These 
intrusions  extend  about  1  km  laterally,  and  since  they 
are  found  in  the  middle  of  the  wide  staircase  zone,  they 
seem  not  to  have  entered  from  the  edges.  What  causes 
them?  Marmorino  (1991)  speculates  that  they  may 
arise  because  of  lateral  variations  in  DD  vertical  buoy¬ 
ancy  fluxes  or  as  a  response  to  mesoscale  stirring.  These 
are  an  important  issues  to  clarify.  If  lateral  phenomena 
such  as  intrusions  control  interface  substructures,  and 
if  these  substructures  control  DD  fluxes,  then  we  won’t 
be  able  to  parameterize  DD  fluxes  in  terms  of  large- 
scale  properties  until  we  can  come  to  grips  with  lateral 
affects.  Further  tests  in  the  field,  and  in  the  laboratory, 
are  sorely  needed. 

Staircases  within  intrusions.  High-resolution  sam¬ 
pling  sometimes  reveals  staircases,  or  at  least  steppy 
profiles,  between  interleaving  intrusions  (see  e.g.  Perkin 
and  Lewis  (1984)  Figure  lib).  These  have  received  sur¬ 
prisingly  little  attention  to  date,  probably  because  sam¬ 
pling  has  been  so  sparse.  Until  fuller  imderstanding  is 
developed,  we  should  view  analyses  based  on  presumed 
DD  flux  laws  between  interleaves  as  being  somewhat 
suspect.  The  issue  may  not  be  easy  to  resolve,  since  in¬ 
terleaving  environments  tend  to  be  more  dynamic  than 
staircase  environments,  and  we  haven’t  come  to  grips 
with  the  latter  yet. 

Q5.  Can  DD  fluxes  be  parameterized? 

Necessity.  While  diagnostic  calculations  for  a  given 
ocean  region  can  be  made  using  DD  fluxes  calculated  by 
direct  {e.g.  microscale)  measurements,  prognostic  calcu¬ 
lations  require  a  parameterization  of  DD  fluxes  in  terms 
of  large-scale  properties. 

Form  of  flux  law.  For  the  DC  case  it  seems  likely  that 
layer-layer  flux  laws  (in  which  fluxes  are  presumed  to  be 
determined  by  the  contrasts  A5  and  AT  between  lay¬ 
ers)  are  valid  in  the  ocean  {Padman  and  Dillon,  1989; 
Padman,  1994).  However,  the  SF  case  is  apparently 
much  more  complicated®.  It  may  be  that  the  details 
of  the  SF  interfaces,  as  opposed  to  the  layer-layer  con¬ 
trasts,  are  important  in  setting  fluxes.  It  may  also  be 
that  external  factors,  such  as  large-scale  shear,  alter 
SF  convection  so  much  that  they  must  be  taken  into 
account  in  trying  to  formulate  a  large-scale  diffusivity 
{Kunze,  1994).  Since  I  think  we  are  closer  to  formulat¬ 
ing  large-scale  parameterizations  for  the  DC  case  than 
the  SF  case,  I’ll  concentrate  on  the  DC  case  here. 
Exponent  in  layer-layer  flux  law.  Early  labora¬ 
tory  work  with  sharp  interfaces  between  well-mixed 


®Or  has  the  SF  case  just  been  better  studied? 
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Figure  3.  Failure  of  the  4/3  flux  law  for  thermal  convection 
in  water.  Dots  indicate  mesisurements  from  five  laboratory 
studies,  compiled  by  Kelley  (1990),  The  4/3  flux  law  pre¬ 
dicts  Nu  oc  and  the  lower  dots  show  this  is  not  true; 

an  exponent  of  0.28  matches  the  measurements  better. 


Figure  4.  Eflfective  large-scale  thermal  diffusivity  for  DC 
case.  The  open  dots  are  from  Kelley  (1984),  the  crosses 
from  Fedorov  (1988),  and  the  boxes  are  inferred  from  the 
G*  values  graphed  by  Padman  and  Dillon  (1987). 


layers  suggested  that  vertical  fluxes  are  proportional 
to  or  for  the  SF  or  DC  cases  respec¬ 

tively,  with  a  proportionality  factor  C  =  C{Rp).  This 
4/3  exponent  was  based  on  a  dimensional  analysis  of 
single-component  convection  {Turner^  1965).  However, 
measurements  of  single-component  convection  contra- 
diet  the  prediction  (Figure  3)  ,  yielding  an  exponent 
nearer  5/4  than  4/3  {Kelley^  1990).  A  lower  exponent 
is  also  predicted  by  convection  theories  {Castaing  et  al, 
1989;  Kelley^  1990)  and  by  direct  numerical  simulations 
of  thermal  convection  {Kerr,  1996).  Does  the  same  ap¬ 
ply  to  DD  convection?  I  am  unaware  of  laboratory  tests 
in  the  DC  case,  but  in  the  SF  case,  the  laboratory  tests 
are  somewhat  contradictory.  Schmitt  (1979)  reported 
support  for  an  exponent  of  4/3,  with  regression-based 
exponents  in  his  Table  3  ranging  from  1.24  to  1.37.  Mc- 
Dougall  and  Taylor  (1984)  reported  that  an  exponent 
of  1.23  matched  their  observations  better  than  4/3,  but 
that  distinguishing  between  the  two  values  was  prob¬ 
lematic  with  their  measiirements.  Taldng  these  things 
together,  the  value  of  the  exponent  must  be  regarded  as 
an  open  question.  One  might  also  ask  whether  some  of 
the  scatter  in  the  empirical  value  of  C{Rp)  might  result 
from  the  extrapolation  errors  resulting  from  using  an 
incorrect  exponent  {Kelley,  1990). 

Linking  large-scale  properties  to  vertical  fluxes. 
If  fluxes  are  governed  by  layer-layer  flux  laws,  as  they 
appear  to  be  in  the  DC  case,  then  parameterizing  fluxes 
is  equivalent  to  parameterizing  the  thickness  of  layers 
within  staircases,  since  layer  thickness  together  with 
large-scale  gradients  yields  the  A5  and  AT  values  re¬ 
quired  in  order  to  calculate  fluxes.  This  idea  is  the  gist 
of  a  proposed  parameterization  of  large-scale  diflFusivi- 
ties  for  the  DC  case  {Kelley,  1984,  1988),  illustrated  in 
Figure  4  here.  The  original  measurements  are  shown  in 
this  figure  along  with  the  closely  matching  values  of  Fe¬ 


dorov  (1988).  However,  the  values  reported  by  Padman 
and  Dillon  (1987)  are  in  systematic  disagreement,  yield¬ 
ing  a  reduction  in  the  large-scale  diffusivity  by  a  factor 
of  approximately  3.  The  reason  for  this  discrepancy  is 
unknown,  and  more  observations  would  help  to  clar¬ 
ify  whether  the  layer-thickness  (and  diffusivity)  scaling 
presented  by  Kelley  (1984)  is  generally  valid.  Along 
similar  lines,  it  would  help  to  examine  the  pattern  of 
variation  of  layer  thickness,  looking  for  the  “split”  lay¬ 
ers  proposed  by  Kelley  (1988)  to  be  a  signature  of  a 
process  that  controls  layer  thickness. 

Interleaving  fluxes.  It  is  not  clear  how  to  parameter¬ 
ize  interleaving  fluxes  since  the  dynamics  are  still  not 
understood.  Even  energy-based  arguments  seem  ten¬ 
uous,  given  that  the  energy  flow  depends  on  whether 
the  physics  involves  DD,  differential  mixing,  baroclinic 
exchange,  etc.  As  the  C-SALT  experiment  set  a  firm 
foundation  for  analysis  of  the  SF  staircase  mode,  so 
might  a  dedicated  field  study  enliven  research  on  inter¬ 
leaving.  In  the  meantime,  gross  sensitivity  studies  in  a 
GCM  would  be  welcome, 

Q6.  Is  DD  important? 

Locally.  In  regions  with  regular-shaped  staircase  sig¬ 
natures,  it  seems  reasonable  to  conclude  that  DD  is  im¬ 
portant  compared  with  other  forms  of  mixing.  It  has 
been  argued  that  these  are  regions  with  weak  turbulent 
mixing  rates,  and  that  this  might  suggest  that  DD  is  not 
important.  However,  if  a  region  is  of  enough  interest  to 
foster  dynamical  study,  then  the  mixing  in  that  region 
must  also  be  of  interest,  whether  it  be  large  or  small. 
(The  Arctic  is  a  prime  example.)  And  what  of  regions 
that  lack  DD  signatures?  It  may  be  that  DD  is  signifi¬ 
cant  nonetheless.  For  example,  St.Laurent  and  Schmitt 
(1999)  suggest  that  at  the  site  of  the  NATRE  experi- 
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ment,  where  thermohaline  staircases  were  not  present, 
up  to  half  the  diffusion  of  an  injected  tracer  might  have 
been  transported  by  DD. 

Globally.  Does  DD  play  an  important  global  role,  say 
to  the  rate  of  overturning  circulation,  or  to  the  pole- 
ward  heat  flux,  two  quantities  of  great  interest  for  cli¬ 
mate  studies?  One  way  to  tackle  such  questions  is 
with  GCM  sensitivity  studies.  I  am  unaware  of  at¬ 
tempts  to  address  the  interleaving  mode  in  global  do¬ 
mains,  either  in  terms  of  isopycnal  fluxes  or  diapycnal 
fluxes  (the  latter  being  addressed  theoretically  by  Gar¬ 
rett  (1982)),  but  some  preliminary  studies  have  been 
done  of  the  staircase  mode.  So  far,  the  answer  seems  to 
be  divided.  Some  studies  suggest  a  large  importance, 
others  a  negligible  importance.  For  an  example,  the 
GCM  simulations  by  Zhang  et  al.  (1998)  and  Merry  field 
et  al.  (1999),  using  similar  DD  flux  parameterizations, 
yielded  very  different  results.  The  first  study  found  that 
including  DD  mixing  reduces  the  overturning  circula¬ 
tion  by  22  percent,  while  the  second  study  found  only 
about  1  percent.  Why  do  these  results  differ  so  greatly? 
Stating  differences  between  the  model  configurations  is 
straightforward.  The  former  focused  on  a  single  basin, 
the  latter  on  the  globe;  the  former  used  square  walls 
and  zonally-averaged  surface  forcing,  the  latter  used  re¬ 
alistic  geography  and  forcing,  etc.  However,  explain¬ 
ing  the  differences  in  results  is  not  so  straightforward, 
and  may  justify  further  study.  Models  of  intermedi¬ 
ate  complexity  and  type  might  reveal  why  the  Zhang 
et  al  (1998)  and  Merryfield  et  al  (1999)  results  dif¬ 
fer  so  much.  As  is  usual  in  ocean  models,  the  form  of 
the  surface  boundary  conditions  may  be  crucial.  A  fol¬ 
lowup  to  the  Zhang  et  al  (1998)  study,  which  employed 
mixed  surface  boundary  conditions  instead  of  the  re¬ 
laxation  conditions  used  by  Zhang  et  al  (1998),  found 
that  DD  had  very  little  affect  on  the  overturning  circu¬ 
lation  {Zhang  and  Schmitt,  2000).  On  the  other  hand, 
it  revealed  a  heightened  sensitivity  of  the  circulation 
stability  to  freshwater  forcing.  Until  the  contradictory 
results  of  such  coarse-resolution  GCM  studies  are  better 
understood,  the  importance  of  DD  fluxes  to  the  global 
overturning  circulation  rate  remains  less  than  certain. 

Answers 

Few  of  the  questions  listed  above  were  unposed  thirty 
years  ago.  When  will  we  have  answers,  and  how  will  we 
get  them?  It  seems  clear  that  the  answers  to  some  of  the 
small-scale  questions  (e.^.  how  do  salt  fingers  react  to 
shear?)  may  soon  be  provided  by  direct  numerical  sim¬ 
ulations.  A  new  era  of  oceanographically  relevant  direct 
numerical  simulations  (which  I  think  is  imminent;  see 
appendix)  may  free  us  from  the  uncomfortable  posture 
of  straddling  oversimplified  theories  and  richly-complex 


laboratory  simulations.  However,  it  will  be  a  long  time 
before  direct  numerical  simulations  will  have  the  scope 
to  match  laboratory  (meter)  scales,  let  alone  oceanic 
scales  that  are  orders  of  magnitude  larger.  Indeed,  if 
history  is  any  guide,  future  theoretical  and  laboratory 
work,  as  well  as  direct  numerical  simulations,  will  need 
field  experiments  to  provide  ground  truth  and  also  to 
suggest  relevant  problems  to  study.  Until  we  learn  how 
double-diffusive  structures  are  formed  in  the  ocean,  and 
how  double  diffusion  interacts  with  its  competitors,  we 
cannot  assess  the  significance  of  double  diffusion  in  the 
vertical  mixing  of  the  sea,  fulfilling  the  goal  that  Stem 
(1960)  stated  in  such  sanguine  words,  so  long  ago. 

Appendix:  A  computational  laboratory? 

A  3D  numerical  simulation  on  an  iV  x  iV  x  TV*  grid 
requires  AN^  numerical  operations  per  timestep,  where 
A  ~  10^  depends  on  the  coding  and  computer  ar¬ 
chitecture^.  Thus,  a  computer  that  performs  F  op¬ 
erations  per  second  can  do  a  realtime  simulation  (di¬ 
rectly  competitive  with  a  laboratory  study)  only  if 
N  <  {FIYt/A)^^^,  where  At  is  the  model  timestep. 

Under  laboratory  conditions,  the  width  of  salt  fin¬ 
gers  is  ~  3  X  lO-^m.  If  ^k/ks  10  gridpoints  are 
required  for  adequate  resolution,  a  reasonable  compu¬ 
tational  mesh  might  have  resolution  Ax  ~  3  x  10  “^m 
and  a  diffusive-limit  timestep  At  ~  Ax^/i/  10“^s. 
A  desktop  computer  can  perform  F  10®  operations 
per  second,  setting  a  realtime  limit  of  AT  ~  20,  i.e.  a 
domain  that  can  hold  under  a  dozen  salt-fingers.  This 
suggests  that  a  desktop  computer  cannot  produce  use¬ 
ful  realtime  simulations.  Switching  to  a  supercomputer 
increases  F  by  a  factor  of  10®,  so  A/’  ~  200  and  thus  the 
domain  can  hold  hundreds  of  salt  fingers.  Such  simula¬ 
tions  may  be  of  great  utility,  but  even  they  span  only 
1/lOth  the  scale  of  typical  laboratory  experiments. 

What  if  the  realtime  constraint  is  relaxed?  Matching 
the  laboratory  domain  size  by  increasing  N  tenfold  in¬ 
creases  the  computational  requirement  by  10®;  a  year  of 
CPU  time  would  be  required  to  simulate  an  afternoon 
in  the  laboratory.  Thus,  it  seems  that  the  laboratory 
is  the  better  place  for  free-wheeling  investigation,  for 
exploring  parameter  space,  etc. 

When  will  this  change?  Assuming  that  computer 
power  continues  to  double  every  1.5  years  (Afann,  2000), 
within  a  decade  niunerical  simulations  running  at  1  /10th 
realtime  (arguably  a  practical  limit)  will  match  labora¬ 
tory  scales.  Thus,  we  may  soon  enter  a  new  era,  in 


^This  estimate  of  A  is  perhaps  accurate  to  an  order  of  magni¬ 
tude,  but  the  accuracy  is  of  little  interest  here.  The  main  point  is 
the  cubic  dependence  of  computational  cost  on  N,  which  seems 
inarguable  for  models  based  on  grids  of  fixed  geometry. 
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which  numerical  simulation  is  a  common  adjunct  to,  or 
a  replacement  for,  laboratory  work. 

The  limiting  factor  may  be  software,  not  hardware. 
For  example,  a  typical  laboratory  setup  can  be  de¬ 
scribed  in  a  few  sentences  and  reproduced  both  broadly 
and  swiftly,  whereas  model  codes  are  not  easily  devel¬ 
oped,  reproduced,  or  modified.  Formalized  schemes  for 
sharing  code  might  prove  useful,  as  they  have  in  the 
open-source  community  {Raymond  and  Young,  2001). 
Funding  agencies  could  help  by  requiring  researchers  to 
use  open-source  development  practices,  perhaps  sharing 
codes  only  after  a  time  delay,  as  is  done  with  hydro- 
graphic  data. 
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Over  fifty  years  ago,  Carl  Eckart  described  the  sequence 
of  processes  as  a  passive  scalar  is  stirred  and  mixed  in  a 
turbulent  flow  {Eckart,  1948).  At  first,  during  the  stirring 
phase,  the  variance  of  the  scalar  gradient  is  greatly  in¬ 
creased,  but  later,  during  the  mixing  phase,  the  gradients 
become  sufficiently  sharp  that  molecular  diffusion  be¬ 
comes  important  and  the  gradient  variance  rapidly  de¬ 
creases.  The  process  is  of  great  practical  importance  in 
many  engineering  situations  as  well  as  being  familiar  from 
adding  cream  to  coffee. 

The  interplay  of  stirring  and  mixing  is  also  important  in 
the  ocean  for  temperature,  salinity,  chlorophyll  and  other 
naturally  occurring  tracers  as  well  as  for  introduced  mate¬ 
rial.  The  situation  is  complicated  by  the  numerous  physical 
processes  that  can  cause  stirring  and  by  the  interplay  of 
processes  that  act  along  isopycnals  (surfaces  of  constant 
mean  density,  where  “mean”  requires  careful  definition) 
with  “diapycnal”  processes  acting  across  the  density  sur¬ 
faces. 

The  Twelfth  ‘Aha  Huliko’a  Hawaiian  Winter  Workshop 
was  convened  from  15  to  19  January  2001  to  review  this 
theme,  with  the  support  of  the  Office  of  Naval  Research 
and  the  participation  of  25  invited  speakers.  The  general 
topics  included  in  presentations  and  discussions  were  (1) 
mesoscale  eddy  parameterization,  (2)  general  approaches 
to  stirring,  including  the  application  of  ideas  from  dynami¬ 
cal  systems  theory,  (3)  inertial  instability,  submesoscale 
motions  and  vortical  motions,  (4)  the  interplay  of  isopyc- 
nal  and  diapycnal  processes,  (5)  processes  in  the  surface 
mixed  layer,  (6)  the  stirring  and  mixing  of  active,  particu¬ 
larly  biological,  tracers,  (7)  mixing  efficiency,  i.e.  the  frac¬ 
tion  of  energy  lost  from  the  mean  flow  which  produces  a 
vertical  buoyancy  flux,  and  (8)  differential  mixing  of  heat 
and  salt  in  doubly-stable  environments  as  well  as  those 
which  permit  double  diffusion.  Before  summarizing  the 
discussions  on  these  topics,  we  first  review  the  general 
approach  to  parameterizing  unresolved  processes. 

The  need  for  parameterization 

Ocean  general  circulation  models  used  in  climate  stud¬ 
ies  lack  the  resolution  to  treat  small-scale  processes  explic¬ 
itly  and  so  must  parameterize  their  effects  in  terms  of  the 


average  properties  that  are  resolved.  If  the  eddy  flux  of  a 
scalar  is  related  by  a  tensor  to  the  local  gradient  of  the 
mean  concentration  of  the  scalar,  as  for  a  mixing  length 
argument,  the  symmetric  part  of  the  tensor  is  diagonaliz- 
able  and  may  be  thought  of  as  anisotropic  diffusion.  The 
general  assumption  is  that  two  principal  directions  are 
along  the  “neutral”  surfaces  of  constant  mean  potential 
density,  with  very  much  greater  diffusivity  in  these  direc¬ 
tions  than  in  the  direction  normal  to  the  surfaces.  The 
isopycnal  diffusion  along  the  neutral  surfaces  is  largely 
associated  with  mesoscale  eddies,  whereas  the  diapycnal 
mixing  across  them  is  caused  by  breaking  internal  waves 
and,  in  some  locations,  double-diffusive  processes. 

The  antisymmetric  part  of  the  tensor  describes  a  “skew 
flux”  normal  to  the  mean  gradient  of  the  scalar.  Part  of  this 
flux  is  non-divergent  and  does  not  affect  the  mean  scalar 
concentration.  The  remaining,  divergent,  part  may  be  ex¬ 
pressed  as  simple  advection  of  the  scalar  by  an  additional 
mean  flow,  the  “skew  velocity”,  over  and  above  that  which 
would  be  recorded  by  current  meters  at  a  fixed  location.  To 
some  extent  the  skew  flux  can  be  thought  of  as  represent¬ 
ing  an  eddy  flux  of  thickness,  or  spacing  between 
neighbouring  isopycnals,  but  it  is  actually  a  more  subtle 
representation,  in  a  coordinate  system  using  z  as  the  verti¬ 
cal  coordinate,  of  a  flux  that  would  appear  in  a  coordinate 
system  using  the  potential  density  instead. 

These  results  are  purely  kinematic.  Specific  numerical 
values  for  the  diffusivities  and  skew  velocities  could  be 
obtained  from  suitable  measurements  in  the  present  ocean. 
Any  model  that  aspires  to  predictive  capability,  however, 
requires  an  understanding  of  the  dynamical  processes  re¬ 
sponsible  for  the  transports  to  which  the  model  output  is 
sensitive.  The  parameterizations  must  then  be  specified  as 
formulae  rather  than  just  numerical  values.  Consider 
diapycnal  mixing  by  internal  wave  breaking.  If  it  is  to  be 
characterized  not  by  a  number  but  by  a  formula  then  the 
origin  of  the  wave  field  becomes  important.  Vastly  differ¬ 
ent  feedback  loops  can  be  obtained  in  climate  studies  de¬ 
pending  on  whether  the  intensity  of  the  wave  field  and 
hence  diapycnal  mixing  is  determined  by  the  ocean  circu¬ 
lation,  cascading  energy  via  eddies  to  the  waves,  or  by  the 
atmospheric  wind  field  generating  waves  at  the  sea  sur¬ 
face,  or  by  the  surface  tide  generating  waves  at  the  sea 
floor. 
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The  need  to  parameterize  subgrid  processes  is  not  idio¬ 
syncratic  to  oceanic  general  circulation  models  but  generic 
to  all  models.  There  are  always  unresolved  processes. 

Mesoscale  eddies 

So-called  mesoscale  eddies,  with  a  spatial  scale  of  the 
order  of  100  km  and  a  time  scale  of  the  order  of  a  month, 
are  a  dominant  feature  of  ocean  variability.  They  are  dy¬ 
namically  analogous  to  synoptic-scale  eddies  in  the  atmos¬ 
phere.  The  deformation  of  an  air  layer  by  such  eddies  is 
depicted  in  Pierre  Welander’s  (1955)  now  famous  check¬ 
erboard  figure,  here  reproduced  as  Figure  1.  The  figure 
also  depicts  the  stirring  along  oceanic  isopycnals  if  time 
and  space  are  appropriately  rescaled. 


Figure  1.  Stirring  without  mixing  of  passive  tracer.  The  initial 
streamlines  of  the  two-dimensional  turbulent  flow  are  shown  on 
top.  The  flow  deforms  the  checkerboard  pattern  into  elongated 
filaments.  In  the  atmospheric  case,  the  length  of  the  checkerboard 
is  1200  km  and  the  deformation  is  shown  after  6,  12,  24  and  36  h. 
In  the  oceanic  case,  the  length  of  the  checkerboard  is  100  km  and 
the  deformation  is  shown  after  3,  6,  12  and  18  d.  (After  We- 
lander,  1955). 

Determining  the  magnitudes  of  the  isopycnal  diffusivity 
and  skew  velocity  associated  with  oceanic  mesoscale  ed¬ 
dies  re(]uires  an  understanding  derived  from  a  combination 
of  general  principles,  dynamical  studies  of  processes  such 
as  baroclinic  instability,  and  numerical  models  that  resolve 
the  eddies  in  various  idealized  situations.  There  are  no 
simple  answers.  Invoking  general  principles,  such  as  mix¬ 
ing  potential  vorticity,  can  imply  unphysical  sources  of 


momentum  and  energy;  eddy  parameterizations  should  not 
do  what  eddies  cannot  do!  To  some  extent  the  eddies 
serve  to  release  available  potential  energy  from  the  mean 
state,  as  in  the  popular  scheme  of  Gent  and  McWilliams 
(1990),  but  this  will  lead  to  an  ocean  at  rest  in  a  situation, 
without  forcing  or  dissipation,  where  a  steady,  stable,  cir¬ 
culation  is  more  likely.  Moreover,  idealized  eddy¬ 
resolving  runs  have  mostly  been  used  to  produce  numeri¬ 
cal  values  for  the  free  parameter  in  the  Gent  and 
McWilliams  scheme,  rather  than  the  formula  in  terms  of 
resolved  variables  which  we  really  need  for  predictive 
models. 

The  extant  parameterizations  of  mesoscale  eddies  really 
just  treat  the  stirring  effect  of  the  eddies.  One  unanswered 
question  is  whether  this  stirring  connects  directly  to  mo¬ 
lecular  mixing  at  very  fine  scales,  or  whether  there  is  an 
intermediate  mechanism,  in  series  with  the  eddy  stirring, 
which  accomplishes  the  transfer  to  mixing.  Another  ques¬ 
tion  concerns  the  ultimate  fate  of  the  energy  released  from 
the  mean  state.  Parameterization  schemes  essentially  as¬ 
sume  that  this  is  dissipated  adiabatically,  but  this  may  not 
be  true.  We  return  to  both  these  issues  later. 

Another  very  serious  question  is  whether  the  effect  of 
mesoscale  eddies  can  be  represented  as  a  purely  local  ef¬ 
fect,  with  eddy  fluxes  just  proportional  to  local  mean  gra¬ 
dients.  This  may  be  reasonable  in  some  regions,  such  as 
the  Antarctic  Circumpolar  Current,  but  inappropriate  for 
non-zonal  flows  such  as  the  Gulf  Stream  from  which  radi¬ 
ating  Rossby  waves  and  coherent  rings  may  carry  proper¬ 
ties  over  large  enough  distances  to  invalidate  any  local 
mixing  length  approach.  In  such  situations  eddies  may 
drive,  rather  than  dissipate,  the  mean  flow. 

Ocean  models  may  remain  inadequate  until  mesoscale 
eddies  are  resolved  explicitly.  Quite  apart  from  the  diffi¬ 
culties  associated  with  mesoscale  eddies,  high  resolution 
also  seems  to  be  required  to  produce  a  sufficiently  high 
effective  Reynolds  number  for  the  correct  representation  of 
phenomena  such  as  the  separation,  variability,  and  penetra¬ 
tion  of  the  Gulf  Stream.  Although  results  stabilize  with 
increasing  resolution,  they  remain  sensitive  to  choices  in 
forcing  and  viscosity  parameterization,  even  at  the  finest 
grid  spacing  used  today. 

General  approaches  to  stirring 

Turbulent  stirring  leads  to  the  distortion  of  a  tracer  field. 
The  exact  distortion  is  determined  by  the  complete  La- 
grangian  evolution  of  the  flow  field,  but  much  can  be  in¬ 
ferred  from  the  velocity  gradient  tensor.  If  the  relative  mo¬ 
tion  near  a  point  is  dominated  by  vorticity,  then  tracer  gra¬ 
dients  tend  to  align  with  the  shear  and  not  increase.  If,  on 
the  other  hand,  the  rate  of  strain  dominates  the  relative 
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motion,  then  a  tracer  tends  to  become  elongated  along  the 
principal  axis  having  the  largest  rate  of  strain  and  the 
tracer  gradient  grows  exponentially. 

Simulations  of  two-dimensional  flows  show  additional 
phenomena  such  as  barriers  to  mixing  and  areas  of  fila¬ 
ment  production.  These  barriers  and  areas  can  be  identified 
when  additional  information  is  used.  In  one  approach,  the 
vorticity  (from  the  velocity  field)  is  combined  with  the 
rotation  of  the  rate  of  strain  tensor  (from  the  acceleration 
field)  into  an  effective  rotation  to  characterize  the  flow 
field.  Another  approach  uses  concepts  from  dynamical 
systems  theory.  It  identifies  hyperbolic  regions  in  the  flow 
field  (characterized  by  strong  deformations  in  the  velocity 
gradient  tensor)  to  calculate  special  material  curves,  or 
manifolds,  that  determine  the  stirring  and  mixing.  A  blob 
of  tracer  can  advect  without  much  distortion,  but  then  be 
ripped  apart  rather  abruptly  when  it  encounters  these  hy¬ 
perbolic  regions  in  a  flow.  Realistic  simulations  of  the  sur¬ 
face  currents  in  the  Gulf  of  Mexico  illustrate  this  process 
(Figure  2). 


Figure  2.  Evolution  of  a  blob  of  tracer  in  a  simulation  of  the 
surface  currents  in  the  Gulf  of  Mexico.  The  Loop  Cun  ent  is  lo¬ 
cated  in  the  lower  right  comer  and  a  cyclone  to  the  northwest. 
Height  anomalies  are  shown  as  dashed  lines.  The  red  and  blue 
lines  are  inflowing  and  outflowing  manifolds.  There  are  two  hy¬ 
perbolic  regions  associated  with  the  cyclone  and  the  Loop  Cur¬ 
rent.  On  day  196.75  a  circular  blob  (black)  straddles  an  inflowing 
manifold  (blue).  This  manifold  delineates  which  portion  of  the 
blob  will  travel  west  into  the  central  Gulf  and  which  portion  will 
travel  east  through  the  Straits  of  Florida.  The  blob  is  stirred  with 
modest  deformation  for  the  next  3.5  days  as  it  rotates  around  the 
cyclone.  On  day  200.25,  however,  the  blob  approaches  the  hy¬ 
perbolic  point  created  by  the  intersecting  inflowing  and  outflow¬ 
ing  (blue)  manifolds  to  the  southwest  of  the  cyclone.  Here  it  be¬ 
gins  to  show  strong  compression  along  the  inflowing  manifold 
direction  and  strong  distortion  along  the  outflowing  manifold 
direction.  Day  202.00  shows  the  fate  of  the  blob  when  it  reaches 
this  intersection.  It  has  collapsed  along  the  outflowing  manifold. 
(Courtesy  of  A.D.  Kirwan  and  L.  Kantha). 


Inertial  instability,  submesoscale  motions,  and 
vortical  motions 

Mesoscale  motions  may  arise  from  baroclinic  instability 
of  large-scale  flows  in  the  ocean  and  may,  in  turn,  be  un¬ 
stable  to  submesoscale  eddies  an  order  of  magnitude 
smaller.  Near  the  sea  surface  it  seems  that  these  eddies, 
often  visible  at  the  sea  surface  via  beautiful  spiral  patterns 
(Figure  3),  are  mainly  cyclonic,  quite  possibly  because 
anticyclonic  ones  would  break  down  rapidly  via  an  inertial, 
or  barotropic,  instability.  These  small  eddies  may  be  a  link 
in  the  pathway  from  stirring  to  mixing,  but  this  has  not 
been  quantified.  Similarly,  in  the  ocean  interior  there 
seems  to  be  a  family  of  “vortical  modes”,  of  largely  un¬ 
known  scale  range  and  intensity,  which  play  a  role  in  lat¬ 
eral  stirring.  They  can  be  generated  by  frictional  effects  in 
flow  past  topographic  features  and,  possibly,  by  the  col¬ 
lapse  and  geostrophic  adjustment  of  regions  mixed  by 
breaking  internal  waves. 


Figure  3.  A  pair  of  interconnected  spirals  in  the  Mediterranean 
Sea  south  of  Crete.  The  vortex  pair  has  a  clearly  visible  stagna¬ 
tion  point  between  the  two  spirals,  the  cores  of  which  are  aligned 
with  the  preconditioning  wind  field.  Picture  taken  on  October  7, 
1984.  (Courtesy  of  W.  Munk). 

A  general  question  about  the  links  between  low  fre¬ 
quency  eddies  and  higher  frequency  motions  concerns  the 
extent  to  which  the  mesoscale  or  submesoscale  eddies 
generate  internal  waves,  particularly  in  situations  where 
the  eddies  generate  sharp  fronts.  Theoretically,  the 
mesoscale  and  submesoscale  eddies  are  assumed  to  satisfy 
some  sort  of  balanced  dynamics,  such  as  geostrophy. 
These  balanced  motions  are  also  called  the  “slow  mani¬ 
fold”.  Balanced  turbulent  flows  exhibit  an  upscale  energy 
cascade  and  are  very  inefficient  in  dissipating  energy.  Inter¬ 
nal  gravity  waves  and  other  more  nearly  isotropic  flows,  on 
the  other  hand,  exhibit  a  downscale  cascade  with  effective 
dissipation.  The  structure  of  the  equations  governing  bal- 


202 


MULLER  AND  GARRETT 


anced  flows  is  such  that  they  can  be  integrated  forward  in 
time  only  if  certain  conditions  are  met.  Analysis  of  several 
classical  instability  problems  shows  that  the  fluid- 
dynamical  transition  occurs  in  the  neighborhood  of  these 
integrability  conditions.  It  is  hypothesized  that  violation  of 
these  conditions  characterizes  in  general  the  flow  regimes 
where  energy  is  transferred  from  balanced  to  unbalanced 
motions. 

The  interplay  of  isopycnal  and  diapycnal 
processes 

When  a  scalar  is  stirred  by  high  Reynolds  number  tur¬ 
bulence  in  homogeneous  water,  the  eddy  flux  is  mainly 
determined  by  the  turbulent  stirring  and  is  largely  inde¬ 
pendent  of  the  small  value  of  molecular  diffusivity.  Reduc¬ 
ing  the  latter  would  merely  lead  to  finer  scales  in  the  tracer 
field,  without  significantly  affecting  the  turbulent  trans¬ 
port,  The  situation  is  more  subtle  in  the  ocean  because  of 
the  need  to  quantify  both  isopycnal  and  diapycnal  trans¬ 
ports.  One  issue,  already  raised,  is  the  extent  to  which  en¬ 
ergy  released  by  mesoscale  eddies  is  dissipated  in  the 
ocean  interior,  perhaps  by  breaking  internal  waves  which 
have  drawn  some  of  that  energy  from  the  eddies.  This 
could  partly  tie  diapycnal  mixing  rates  to  the  lateral  stir¬ 
ring  rates. 

Another  issue  concerns  the  ultimate  mechanism  for  the 
destruction  of  fine-scale  lateral  gradients  of  temperature 
and  salinity  created  by  stirring  on  isopycnals.  One  possibil¬ 
ity  is  that  vertical  shear  of  the  stirring  process  leads  to  very 
strongly  tilted  frontal  regions,  so  that  pre-existing  diapy¬ 
cnal  mixing  can  remove  the  strong  gradients.  This  could  be 
described  as  a  “passive”  destruction  of  the  strong  lateral 
gradients.  A  more  active  process  is  possible:  thermohaline 
fronts  caused  by  lateral  stirring  on  isopycnals  could  be¬ 
come  unstable  to  a  double-diffusive  interleaving  process. 
This  would  introduce  further  diapycnal  mixing  at  the  same 
time  as  it  destroyed  lateral  gradients. 

The  relative  importance  of  these  scenarios  has  not  been 
established.  Preliminary  estimates  favor  the  latter  process, 
the  signature  of  which  should  be  internal  double-diffusive 
interleaving  regions  in  the  ocean.  These  are  certainly 
known  to  occur,  but  a  systematic  catalog  of  their  locations 
and  characteristics  has  not  been  developed.  If  the  process 
is  important  it  may  provide  significant  diapycnal  mixing  in 
regions  with  strong  lateral  gradients  of  potential  tempera¬ 
ture,  and  compensating  salinity,  on  isopycnals.  Moreover, 
the  diapycnal  mixing  is  upgradient  for  density,  as  is  neces¬ 
sary  in  any  double-diffusive  process. 

A  significant  sub-theme  which  emerged  on  this  and 
other  issues  concerns  the  extent  to  which  different  stirring 
and  mixing  processes  are  in  parallel  or  in  series.  If  two 


processes  are  in  parallel,  their  strengths  may  be  compared 
to  decide  which  is  more  important.  If  processes  are  in  se¬ 
ries,  one  might  need  to  know  which  is  the  rate-controlling 
one.  In  the  usual  case  of  the  stirring  of  a  tracer  in  a  homo¬ 
geneous  fluid,  molecular  mixing  clearly  operates  in  series, 
but  with  the  ultimate  mixing  rate  being  controlled  by  the 
turbulent  stirring.  In  the  ocean,  where  there  might  be  some 
intermediate  process  between  the  stirring  agents  and  the 
final  molecular  diffusion,  the  answer  may  be  more  compli¬ 
cated.  For  example,  the  flux  of  warm  water  across  the  Gulf 
Stream  may  be  controlled  by  the  amount  of  air-sea  interac¬ 
tion  that  occurs  in  warm-core  rings  before  they  are  reab¬ 
sorbed  by  the  Stream;  the  flux  is  not  necessarily  controlled 
just  by  the  ring  formation  rate.  The  general  issue  of  series 
or  parallel  processes  needs  to  be  borne  in  mind  in  any  at¬ 
tempt  to  focus  on  processes  that  are  important,  and  may  be 
aided  by  a  triple  decomposition  into  mean,  eddies  and  tur¬ 
bulence  (Figure  4). 


Eddies 


Turbulence  — . ^  Dissipation 


Figure  4.  A  simple  schematic  of  the  path  of  scalar  concentration 
variance  from  mean  to  dissipation  via  eddies  and  turbulence. 
(Courtesy  of  C.  Garrett). 


Processes  in  the  surface  mixed  layer 

Because  of  intense  vertical  mixing  there,  the  surface 
mixed  layer  of  the  ocean  provides  many  examples  of  the 
interplay  of  stirring  and  mixing  processes.  One  such  proc¬ 
ess  relies  on  the  fact  that  the  density  of  sea  water  is  deter¬ 
mined  by  both  temperature  and  salinity. 

Typical  of  the  surface  mixed  layer  are  compensated 
fronts  that  are  warm  and  salty  on  one  side  and  cold  and 
fresh  on  the  other  side  such  that  the  density  contrast  is 
small.  Figure  5  shows  an  example  along  a  longitude  line  in 
the  subtropical  North  Pacific.  The  existence  of  the  fronts 
has  been  rationalized  as  follows  {Young,  1994):  Nonho- 
mogeneous  atmospheric  forcing  and  entrainment  of  ther- 
mocline  waters  generate  horizontal  gradients  in  tempera¬ 
ture,  salinity,  and  density.  Density  fronts  slump  by  the  ac¬ 
tion  of  horizontal  density-driven  shear  currents  which  are 
eventually  arrested  by  vertical  mixing.  This  process  can  be 
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Figure  5.  Potential  temperature  (red  line),  salinity  (blue  line)  and 
potential  density  (black  line)  from  a  horizontal  Sea  Soar  tow  at 
50  m  in  the  subtropical  North  Pacific,  at  140°W,  between  28  and 
30°N.  The  depth  is  in  the  middle  of  the  local  mixed  layer.  The 
vertical  axes  for  temperature  and  salinity  are  scaled  by  the  expan¬ 
sion  coefficients  so  that  excursions  of  temperature  and  salinity 
show  the  change  they  imply  for  density.  (Courtesy  of  R.  Ferrari). 

thought  of  as  shear  dispersion.  Compensated  fronts  do  not 
experience  such  shear  dispersion.  The  result  is  that  density 
fronts  diffuse  whereas  compensated  fronts  persist.  The 
ubiquitous  existence  of  compensated  fronts  suggest  that 
horizontal  mixing  in  the  surface  mixed  layer  is  very  much 
larger  for  density  than  for  “spice”,  the  combination  of 
temperature  and  salinity  that  is  “orthogonal”  to  density. 
Indeed,  eddy-resolving  models  that  diffuse  temperature 
and  salinity  and  hence  density  and  spice  with  the  same 
coefficient  are  unable  to  reproduce  the  observed  tendency 
towards  thermohaline  compensation. 

Air-sea  fluxes  and  the  behavior  of  the  surface  mixed 
layer  are  also  involved  in  a  thermodynamic  approach  to 
water-mass  formation.  Basically,  water  must  flow  across 
isopycnals  to  balance  surface  buoyancy  loss  or  gain,  and 
the  convergence  of  this  volume  flux  provides  an  estimate 
of  the  subduction  rate.  This  rate  may  be  compared  with  a 
dynamically-based  prediction  from  Ekman  convergence. 
Both  the  thermodynamic  and  dynamic  approaches,  how¬ 
ever,  require  allowance  for  the  role  of  eddies  and  diurnal 
and  seasonal  cycling  of  the  mixed  layer  depth.  The  prob¬ 
lem  is  thus  complicated  by  the  interplay  of  various  stirring 
and  mixing  processes  which  need  to  be  understood. 

The  stirring  and  mixing  of  non-conservative 
tracers 

Remote  sensing  of  the  sea  surface  from  space  has  made 
us  aware  of  beautiful  and  complex  patterns  in  biologically 
active  variables  such  as  chlorophyll  (Figure  6).  One  ques¬ 
tion  is  whether  the  observed  patchiness  is  a  consequence  of 
biological  processes  or  merely  of  the  passive  stirring  of 
large-scale  gradients.  Some  clues  come  from  comparing 
the  shapes  of  the  wavenumber  spectra  of  active  and  pas- 
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sive  scalars,  such  as  chlorophyll  and  temperature  respec¬ 
tively.  If  the  former  shows  comparatively  more  variance  at 
small  scales,  this  is  evidence  for  the  importance  of  scalar 
growth  processes. 

Accounting  for  such  patchiness  in  a  model  is  particu¬ 
larly  difficult  if  the  non-conservative  processes  are  nonlin¬ 
ear.  In  that  case  it  is  clearly  inadequate  to  proceed  with  a 
model  that  deals  only  with  an  ensemble  average  variable;  a 
Lagrangian  approach  which  follows  the  development  of 
individual  patches  is  necessary.  A  very  basic  model  of  this 
kind  is  the  “Brownian  bug  model”.  It  describes  an  ensem¬ 
ble  of  bugs  that  perform  random  walks  and  experience 
random  birth  and  death  processes  along  the  way.  Mathe¬ 
matically,  this  is  a  “superprocess.”  A  homogeneous  initial 
distribution  spontaneously  develops  patches  and  voids. 


Figure  6.  Chlorophyll  concentration  in  the  Tasmanian  Sea  as 
observed  by  SeaWiFS,  (Courtesy  of  B.L.  Hua). 

Examining  data  for  evidence  of  patchiness  or  intermit- 
tency  can  be  aided  by  examination  of  high  order  structure 
functions.  For  a  variable  C(x,  t)  the  ^th  order  structure 
function  is  defined  as  (8(C(1))  ^),  where  8C(/)  is  the  differ¬ 
ence  in  C  over  a  separation  /  and  ()  denotes  an  ensemble 
average.  The  structure  function  exponent  ^(^),  defined  by 
fitting  a  power  law  to  the  structure  function,  character¬ 
izes  the  degree  of  intermittency  for  multifractal  processes. 
It  is  a  linear  function  of  q  for  non-intermittent  processes 
and  becomes  a  nonlinear  concave  function  for  intermittent 
processes.  Calculation  of  this  exponent  from  oceanic  ob¬ 
servations  suggests  that,  in  the  scale  range  of  nearly  iso¬ 
tropic  turbulence,  passive  scalars  such  as  temperature  are 
more  intermittent  than  the  advective  velocity  field.  This  is 
a  result  also  obtained  in  numerical  simulations  of  3D  ho¬ 
mogeneous  turbulence  (e.g.,  Jimenez,  2000).  Biologically 
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active  tracers  show  intermittency  beyond  that  associated 
with  turbulent  stirring.  The  excess  increases  with  decreas¬ 
ing  turbulence  intensity  (Figure  7).  These  observational 
results  need  to  be  corroborated  and  understood.  Further  to 
this,  the  possibility  of  a  relationship  between  biological 
and  physical  variables  can  be  studied  through  examination 
of  the  behavior  of  joint  structure  functions,  though  caution 
is  required  to  avoid  over-interpretation  of  artifacts  intro¬ 
duced  by  particular  sampling  schemes  or  noise. 


q 


Figure  7.  Empirical  values  of  the  structure  function  exponent 
^{q)  for  temperature  (circles)  and  in  vivo  fluorescence  for  differ¬ 
ent  values  of  the  turbulent  dissipation  rate  £  (red  £  =  10“^  m^  s"^; 
orange  £  =  5  x  10"^  m“  s"^;  blue  £  =  10“^  m^  s“^;  dark  green  £  =  5 
xl  0“^  m^  s'^;  light  green  £  =  10”^  m^  s“^).  The  linear  curve  (dotted 
line)  is  expected  for  non-interm ittent  turbulence.  The  data  were 
collected  in  the  eastern  English  Channel.  (Courtesy  of  L.  Seu- 
ront). 


Mixing  efficiency 

Much  of  the  diapycnal  mixing  in  the  ocean  is  associated 
with  the  shear  instability  of  internal  waves.  The  subsequent 
overturning  and  stirring  leads  to  finescale  patterns  of  den¬ 
sity  which  then  disappear  through  the  action  of  molecular 
thermal  diffusivity  or  the  molecular  diffusion  of  salt.  The 
turbulent  eddies  are  being  dissipated  by  molecular  viscos¬ 
ity  at  the  same  time  as  they  are  working  against  buoyancy 
forces;  a  major  question  concerns  the  relative  fractions  of 
turbulent  kinetic  energy  lost  to  dissipation  at  a  rate  e  and 
converted  to  mean  potential  energy  at  a  rate  KyN'^,  where 
Ky  is  the  eddy  diffusivity  and  N  the  buoyancy  frequency. 
The  ratio  T  =  is  generally  thought  to  be  about  0.2, 

largely  based  on  calibration  against  other  methods  of  esti¬ 


mating  Ky.  There  is  no  reason  to  suppose  that  T  is  a  univer¬ 
sal  constant,  however,  but  it  has  not  been  established  what 
environmental  parameters  might  determine  its  value. 

Some  evidence  reported  at  the  meeting,  based  largely  on 
a  laboratory  experiment  but  with  some  field  support,  sug¬ 
gests  that  r  is  less  than  the  value  of  about  0.2  quoted 
above  if  the  turbulence  is  weak  with  a  “turbulent  Reynolds 
number”  Rej^^  zNN'^  less  than  about  100.  This  is  not  sur¬ 
prising:  weak  turbulence  dies  away  before  it  has  accom¬ 
plished  much  buoyancy  flux.  What  was  a  surprise  was  a 
claim  that  T  also  falls  off  again  for  values  of  Rer  greater 
than  about  1,000.  This  dependence  is  hard  to  accept;  it 
may  transpire  that  Rer  in  these  experiments  was  actually  a 
proxy  for  an  input  variable  that  does  not  involve  the  vis¬ 
cosity  V.  One  variable  that  might  lead  to  reduced  T  in  the 
ocean  is  the  duration  of  high  shear  compared  with  the 
buoyancy  time  N~\  A  long-lived  mixing  event  might  pro¬ 
duce  all  its  buoyancy  flux  at  first  and  subsequently  stir 
water  that  is  already  mixed,  dissipating  energy  but  with 
little  further  buoyancy  flux.  More  study  is  required,  with 
high  resolution  3D  numerical  simulations  of  breaking  in¬ 
ternal  wave  fields  becoming  increasingly  feasible  and  in¬ 
formative. 

The  question  of  mixing  efficiency  is  important  in  any 
assessment  of  the  amount  of  mixing  that  might  be  associ¬ 
ated  with,  for  example,  the  energy  loss  from  the  barotropic 
tide  to  internal  tides.  There  are  also  implications  for  the 
circulation  that  is  diagnosed  from  measured  dissipation 
rates  £  and  the  observed  mean  density  structure.  The  mean 
diapycnal  velocity  w.  may  be  obtained  from  the  mean 
buoyancy  equation  (e.g.,  St.  Laurent  ef  aL,  2001) 

w./V“  -  d(rE)/dz 

so  that  the  value  of  w,  can  depend  on  gradients  of  T.  Vor¬ 
tex  stretching  given  by  fdwjdz  then  affects  the  mean  flow 
along  isopycnals. 

Differential  mixing 

The  rapid  decay  of  turbulent  kinetic  energy  at  small 
values  of  the  turbulent  Reynolds  number  Rej  =  e/vV‘  can 
also  lead  to  less  turbulent  transport  of  salt  than  heat,  given 
the  very  small  molecular  diffusivity  of  the  former.  Al¬ 
though  the  effect  is  clearly  established  in  2D  and  3D  nu¬ 
merical  simulations  and  in  laboratory  experiments  that 
mimic  the  breaking  of  internal  waves  in  a  stratified  ocean, 
it  is  somewhat  counterintuitive  at  first  sight.  If  the  turbu¬ 
lent  fluxes  were  downgradient  at  all  scales  then  the  more 
diffusive  tracer  would  lose  its  higher  wavenumber  fluctua¬ 
tions  and  would  experience  less  turbulent  transport.  It 
seems,  however,  that  the  fluxes  are  upgradient  at  high 
wavenumbers,  as  a  simple  consequence  of  the  secondary 
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instability  and  collapse  of  an  overturning  billow.  The  more 
diffusive  u-acer  thus  loses  some  upgradient  flux  and  there¬ 
fore  experiences  more  downgradient  transport. 

The  importance  of  differential  mixing  for  the  ocean  is 
not  yet  known.  It  may  turn  out  that  in  regions  where  mix¬ 
ing  matters  it  is  at  high  enough  Rej  that  the  eddy  diffusivi- 
ties  of  heat  and  salt  are  the  same.  This  is  particularly  likely 
to  be  true  given  the  intermittent  nature  of  mixing  in  the 
ocean  compared  with  the  rather  steady  stirring  in  the  nu¬ 
merical  experiments;  /Jer  based  on  e  during  a  mixing  event 
in  the  ocean  may  be  much  larger  than  that  based  on  the 
average  8. 

Different  rates  of  mixing  for  heat  and  salt  are,  however, 
firmly  established  for  regions  of  the  ocean  where  double- 
diffusive  processes  are  significant.  These  regions  include 
places,  such  as  the  C-SALT  region  near  Barbados 
{Schmitt,  \99A),  where  the  vertical  profiles  of  temperature 
and  salinity  are  nearly  compensating  in  density,  and  places 
such  as  the  Arctic  Ocean  where  internal  wave  activity  is 
low.  Even  in  these  regions  there  are  significant  puzzles, 
partly  because  laboratory  results  seem  not  to  be  readily 
applicable,  and  partly  because  we  lack  adequate  theoretical 
guidance.  One  obvious  problem  is  to  explain  the  step  size 
in  thermohaline  staircases.  In  the  diffusive  regime  (with 
warm  salty  water  below  cooler  fresher  water),  some  useful 
predictions  come  from  a  local  water  column  model, 
whereas  in  the  salt  fingering  regime  (with  the  opposite 
stratification)  it  seems  that  the  step  size  might  be  set  by 
lateral  intrusive  processes.  The  issue  is  certainly  not  fully 
resolved,  and  the  interfaces  in  the  fingering  regime  also 
seem  more  complex  in  the  ocean  than  in  the  laboratory. 

Importantly,  many  regions  of  the  ocean  which  might 
support  double-diffusive  processes  seem  to  have  these 
disrupted  by  things  like  the  shear  of  internal  waves,  with 
the  breaking  of  these  waves  then  dominating  diapycnal 
transports.  Even  these  regions  can  show  some  evidence  for 
double  diffusive  transports,  however,  so  that  the  phenom¬ 
ena  warrant  continued  attention. 

Summary 

There  are  typically  nearly  10“  spatial  degrees  of  free¬ 
dom  between  the  smallest  scales  being  resolved  by  nu¬ 
merical  models  of  ocean  circulation  and  the  smallest  scales 
of  variability.  The  meeting  was  therefore  being  ambitious 
in  attempting  to  survey  a  vast  range  of  scales  and  phenom¬ 
ena.  Some  general,  if  rather  platitudinous,  principles  do 
seem  clear:  (1)  Studying  the  processes  at  work  in  the  ocean 
is  worthwhile  science.  (2)  There  is  a  need  not  just  to  un¬ 
derstand  these  processes  but  also  to  parameterize  them  in  a 
form  that  can  be  implemented  in  models  for  the  large- 
scale  behavior  of  the  ocean.  (3)  These  models  should  "be 
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used  in  sensitivity  tests  to  determine  which  parameteriza- 
tions  need  improvement. 

It  is  not  really  clear  what  issues  have  the  highest  prior¬ 
ity.  After  a  sometimes  tumultuous  discussion  the  partici¬ 
pants  arrived  at  the  following  ranking  by  vote: 

1 .  What  are  the  constraints  on  eddy  parameterization 
and  how  should  they  be  implemented? 

2.  What  nonlocal  effects  matter  and  how  can  they  be 
parameterized? 

3.  How  uniform  are  stirring  and  mixing  processes?  Are 

there  any  “hotspots,”  particularly  in  association 
with  topographic  features? 

4.  What  are  the  effects  of  diapycnal  boundary 

processes  on  eddy  parameterization? 

5.  What  is  the  parameter  dependence  of  the  mixing 

efficiency  or  flux  Richardson  number? 

6.  How  does  energy  leak  from  the  slow  manifold? 

7.  What  aspects  of  stirring  and  mixing  affect  biological 

processes?  What  do  patterns  of  biological  tracers 
tell  us  about  fluid  dynamics? 

8.  Under  what  circumstances  do  we  need  different 

diffusivities  for  different  tracers? 

9.  What  are  the  effects  of  adiabatic  stirring  on 

diapycnal  mixing? 

10.  Do  subgridscale  processes  provide  stochastic 
forcing? 

The  last  question  addresses  a  very  fundamental  aspect 
of  the  Reynolds  decomposition.  A  second  fundamental 
aspect  is  that  the  simple  Reynolds  decomposition  ideally 
requires  a  spectral  gap,  in  time  or  space,  separating  the 
unresolved  processes  to  be  parameterized  from  the  slowly 
changing  “mean.”  Both  these  issues  were  discussed  at  the 
meeting,  but  without  any  profound  new  insights. 

Stirring  and  mixing  in  a  stratified  ocean  is  the  physics 
that  needs  to  be  parameterized  in  ocean  circulation  models. 
Challenging  open  problems  remain  at  all  levels,  from  very 
fundamental  to  highly  applied  aspects. 
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